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(54) Title: PLANTS HAVING MODIFIED GROWTH CHARACTERISTICS AND A METHOD FOR MAKING THE SAME 



^5 (57) Abstract: The present invention concerns a method for modifying the growth characteristics of plants by modifying expression 
in a plant of a nucleic acid sequence encoding a 2xC2H2 zinc finger protein and/or modifying level and/or activity in a plant of a 
2xC2H2 zinc finger protein. The invention also relates to transgenic plants having modified growth characteristics, which plants 
have modified expression of a nucleic acid encoding a 2xC2H2 zinc finger protein. For example yield of crop plants are improved 
by the methods of the present invention. 
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Plants having modified growth characteristics and a method for 

making the same 

The present invention concerns a method for modifying plant growth characteristics. More 
specifically, the present invention concerns a method for modifying the growth characteristics 
of a plant by modifying expression of a nucleic acid encoding a zinc finger protein and/or by 
modifying the level and/or activity of a zinc finger protein in a plant, which zinc finger protein 
has two zinc finger domains of the type C2H2 (2xC2H2). The present invention also concerns 
plants having modified expression of a nucleic acid encoding a 2xC2H2 zinc finger protein 
and/or modified levels and/or activity of a 2xC2H2 zinc finger protein, which plants have 
modified growth characteristics relative to corresponding wild type plants. 

Given the ever-increasing world population, it remains a major goal of agricultural research to 
improve the efficiency of agriculture. Conventional means for crop and horticultural 
improvements utilise selective breeding techniques to identify plants having desirable 
characteristics. However, such selective breeding techniques have several drawbacks, namely 
that these techniques are typically labour intensive and result in plants that often contain 
heterogeneous genetic components that may not always result in the desirable trait being 
passed on from parent plants. Advances in molecular biology have allowed mankind to modify 
the germplasm of animals and plants in a specific and controlled way. Genetic engineering of 
plants entails the isolation and manipulation of genetic material (typically in the form of DNA or 
RNA) and the subsequent introduction of that genetic material into a plant. Such technology 
has led to the development of plants having various improved economic, agronomic or 
horticultural traits. A trait or growth characteristic of particular economic interest is high yield. 
Yield is normally defined as the measurable produce of economic value from a crop. This may 
be defined in terms of quantity and/or quality. Other important growth characteristics include 
modified architecture, modified growth rate, among others. 

The ability to influence one or more of the abovementioned growth characteristics, would have 
many applications in areas such as crop enhancement, plant breeding, production of 
ornamental plants, arboriculture, horticulture, forestry, production, of algae or plants (for 
example for use as bioreactors, for the production of substances such as pharmaceuticals, 
antibodies, or vaccines, or for the byconversion of organic waste or for use as fuel in the case 
of high-yielding algae and plants). 
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The term "zinc finger" describes a nucleic acid-binding domain in a protein that is folded 
around a tetrahedrally coordinated Zinc ion (Miller et al. 1985. EMBO, 4, 1609-1614). The 
amino acids that coordinate the zinc ion, are always cystein or histidine residues, however, 
diversity occurs in the sequence and length of the zinc finger domain. Zinc finger proteins may 
5 contain several zinc finger domains of the same or different type. Further variability is 
encountered in nature by association of zinc finger domains with other domains. For example, 
some zinc finger proteins are found in association with ring finger or coil-coil domains, to form 
a so-called tripartite domain. There are several types of zinc fingers, such as C2H2, C2HC, 
C2C2. C2H2 is known as the classical zinc finger domain. There are typically two criteria used 

10 to classify zinc finger proteins, the first being the type of zinc finger and the second being the 
number of zinc fingers present in the protein. Zinc finger proteins having a single C2H2 domain 
have been characterised, for example Superman from Arabidopsis and Ramosa I from maize. 
A well-characterised zinc finger protein having three C2H2 domains is the Indeterminate 1 
protein from Maize. Although the first report of this gene (Colasanti et al., Cell. 1998 May 

15 15;93(4):593-603) only mentions the presence of two zinc finger domains, a more 
sophisticated analysis, using pFAM domain search, revealed the presence of three C2H2 zinc 
finger domains. Also known are zinc-finger proteins having only two C2H2 domains, for 
example ZAT10 (STZ )and SCOF-1. This subset of plant zinc finger proteins having two C2H2 
domains have been implicated in plant responses to various stresses (Sakamoto et a/., Gene 

20 248 (1-2) 23-32 (2000)). Both STZ and SCOF-1 have been used to enhance abiotic stress 
tolerance. When over-expressed, STZ has been reported to increase salt tolerance in yeast 
(Lippuner et a/., J Biol Chem. 271 (22) 12859-12866 (1996)) and over-expression of the 
SCOF-1 gene under control of the CaMV 35 S promoter has been reported to enhance cold 
tolerance in Arabidopsis thaliana (Kim et a/., Plant J. 25 (3) 247-259 (2001)). Reports of plants 

25 having modified expression of a zinc finger encoding gene (whether the zinc finger gene is 
mutated, over-expressed or otherwise) describe plants having abnormal growth 
characteristics, none of which (with the exception of cold stress tolerance in transgenic plants 
expressing SCOF-1) are desirable for crops or describe effects that are only detectable under 
particular stress conditions. 

30 

It has now been found that modifying expression in a plant of a 2xC2H2 zinc finger gene 
and/or modifying the level and/or activity in a plant of a 2xC2H2 zinc finger protein gives plants 
having modified growth characteristics. In particular it has been found that introduction into a 
plant of a 2xC2H2 zinc finger nucleic acid gives plants modified growth characteristics, such as 
35 increased yield, modified leaf architecture and altered cycle time, each relative to wild type 
plants. 
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Therefore according to one embodiment of the present invention there is provided a method 
for modifying the growth characteristics of a plant, comprising modifying expression in a plant 
of a nucleic acid encoding a 2xC2H2 zinc finger protein and/or modifying level and/or activity in 
a plant of a 2xC2H2 zinc finger protein. 

5 

The term "modifying" as used herein is taken to mean enhancing, decreasing and/or changing 
in place and/or time. Modifying expression of a nucleic acid encoding a 2xC2H2 zinc finger 
protein or modifying the level and/or activity of the 2xC2H2 zinc finger protein itself 
encompasses altered expression of a gene and/or altered level and/or activity of a gene 

10 product, namely a polypeptide, in specific cells or tissues, when compared to expression, level 
and/or activity of a 2xC2H2 zinc finger gene or protein in corresponding wild-type plants. The 
modified gene expression may result from modified expression of an endogenous 2xC2H2 zinc 
finger gene and/or may result from modified expression of a 2xC2H2 zinc finger gene 
previously introduced into a plant. Similarly, modified levels and/or activity of a 2xC2H2 zinc 

15 finger protein may be due to modified expression of an endogenous 2xC2H2 zinc finger 
nucleic acid/gene and/or due to modified expression of a 2xC2H2 zinc finger nucleic acid/gene 
previously introduced into a plant. Modified expression of a gene/nucleic acid and/or modified 
level and/or activity of a gene product/protein may be effected, for example, by chemical 
means and/or recombinant means. 

20 

Therefore there is provided by the present invention, a method for modifying the growth 
characteristics of a plant, comprising modifying expression, level and/or activity of a 2xC2H2 
zinc finger gene or protein by recombinant means and/or by chemical means. 

25 Advantageously, modifying expression of a nucleic acid encoding a 2xC2H2 zinc finger protein 
and/or modifying level and/or activity of the 2xC2H2 zinc finger protein itself may be effected 
by chemical means, i.e. by exogenous application of one or more compounds or elements 
capable of modifying activity of the 2xC2H2 zinc finger protein and/or capable of modifying 
expression of a 2xC2H2 zinc finger gene (which may be either an endogenous gene or a 

30 transgene introduced into a plant). The term "exogenous application" as defined herein is 
taken to mean the contacting or administering of a suitable compound or element to a plant. 
The compound or element may be exogenously applied to a plant in a form suitable for plant 
uptake (such as through application to the soil for uptake via the roots, or in the case of some 
plants by applying directly to the leaves, for example by spraying). The exogenous application 

35 may take place on wild-type plants or on transgenic plants that have previously been 
transformed with a 2xC2H2 zinc finger nucleic acid/gene or other transgene. 
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Suitable compounds or elements for exogenous application include 2xC2H2 zinc finger 
proteins or 2xC2H2 zinc finger nucleic acids. Alternatively, exogenous application of 
compounds or elements capable of modifying levels of factors that directly or indirectly activate 
or inactivate a 2xC2H2 zinc finger protein will also be suitable in practising the invention. Also 
5 included are antibodies that can recognise or mimic the function of 2xC2H2 zinc finger 
proteins. Such antibodies may comprise "plantibodies", single chain antibodies, IgG antibodies 
and heavy chain camel antibodies, as well as fragments thereof. 

Additionally or alternatively, the resultant effect may also be achieved by the exogenous 
10 application of an interacting protein or activator or an inhibitor of a 2xC2H2 zinc finger 
gene/gene product. Additionally or alternatively, the compound or element may be a mutagenic 
substance, such as a chemical selected from any one or more of: N-nitroso-N-ethylurea, 
ethylene imine, ethyl methanesulphonate and diethyl sulphate. Mutagenesis may also be 
achieved by exposure to ionising radiation, such as X-rays or gamma-rays or ultraviolet light. 
15 Methods for introducing mutations and for testing the effect of mutations (such as by 
monitoring gene expression and/or protein activity) are well known in the art. 

Additionally or alternatively, and according to a preferred embodiment of the present invention, 
modifying expression of a nucleic acid encoding a 2xC2H2 zinc finger protein and/or modifying 
20 level and/or activity of the 2xC2H2 zinc finger protein may be effected by recombinant means. 
Such recombinant means may comprise a direct and/or indirect approach for modifying 
expression of a nucleic acid and/or level and/or activity of a protein. 

For example, an indirect approach may comprise introducing, into a plant, a nucleic acid 
25 capable of modifying expression of the gene in question (a gene encoding a 2xC2H2 zinc 
finger protein) and or capable of modifying the level and/or activity of the protein in question (a 
2xC2H2 zinc finger protein). Examples of such nucleic acids to be introduced into a plant 
include nucleic acids encoding transcription factors or activators or inhibitors that bind to the 
promoter of a 2xC2H2 zinc finger gene or that interact with a 2xC2H2 zinc finger protein. 
30 Methods to test these types of interactions and methods for isolating nucleic acids encoding 
such interactors include yeast one-hybrid or yeast two-hybrid screens in which the 2xC2H2 
zinc finger gene/protein is used as bait. One example of such a transcription regulator is LOS2, 
described as a transcription regulator for the STZ gene. Therefore, the method of the invention 
may also be performed using LOS2, wherein expression of a 2xC2H2 zinc finger gene may be 
35 increased or further increased by decreasing expression of LOS2 in plants. 
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Also encompassed by an indirect approach for modifying expression of a 2xC2H2 zinc finger 
gene and/or for modifying level and/or activity of a 2xC2H2 zinc finger protein is the provision 
of, or the inhibition or stimulation of regulatory sequences that drive expression of a native 
2xC2H2 zinc finger gene or transgene. Such regulatory sequences may be introduced into a 
5 plant. For example, the regulatory sequence to be introduced into a plant may be a promoter 
capable of driving expression of an endogenous 2xC2H2 zinc finger gene. 

A further indirect approach for modifying expression of a 2xC2H2 zinc finger gene and/or for 
modifying level and/or activity of a 2xC2H2 zinc finger protein in a plant encompasses 

10 modifying levels in a plant of a factor capable of interacting with a zinc finger protein. Such 
factors may include ligands of a 2xC2H2 zinc finger protein. Therefore, the present invention 
also provides a method for modifying growth characteristics of a plant, comprising modifying 
expression of a gene coding for a protein which is a natural ligand of a 2xC2H2 zinc finger 
protein. Furthermore, the present invention also provides a method for modifying growth 

15 characteristics of a plant, comprising modifying expression of a gene coding for a protein which 
is a natural target/substrate of a 2xC2H2 zinc finger protein. Examples of such 
targets/substrates include stretches of DNA that are bound by the zinc-finger domains. 

A direct and preferred approach on the other hand comprises introducing into a plant a nucleic 
20 acid encoding a 2xC2H2 zinc finger protein or a portion thereof or sequences capable of 
hybridising therewith, which nucleic acid preferably encodes a 2xC2H2 zinc finger protein or a 
homologue, derivative or active fragment thereof. The nucleic acid may be introduced into a 
plant by, for example, transformation. 

25 Therefore, there is provided a method for modifying growth characteristics of a plant, 
comprising introducing into a plant a 2xC2H2 zinc finger nucleic acid or a portion thereof. 

The 2xC2H2 zinc finger nucleic acid may be derived (either directly or indirectly (if 
subsequently modified)) from any source provided that the sequence, when expressed in a 

30 plant, leads to modified expression of a 2xC2H2 zinc finger-encoding nucleic acid/gene and/or 
modified level and/or activity of a 2xC2H2 zinc finger protein. The 2xC2H2 zinc finger gene or 
protein may be wild type, i.e. the native or endogenous nucleic acid or polypeptide. 
Alternatively, it may be a protein or nucleic acid derived from the same or another species. The 
nucleic acid/gene may then be introduced into a plant as a transgene, for example by 

35 transformation. 
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The nucleic acid may be isolated from a bacteria, yeast or fungi, or from a plant, algae, insect 
or animal (including human) source. This nucleic acid may be substantially modified from its 
native form in composition and/or genomic environment through deliberate human 
manipulation. The nucleic acid is preferably obtained from a plant, whether from the same 
5 plant species in which it is to be introduced or whether from a different plant species. Further 
preferably, the nucleic acid is from a dicot, preferably from the family Brassicaceae, farther 
preferably from Arabidopsis thaliana. More preferably, the nucleic acid is essentially similar to 
a nucleic acid as represented by SEQ ID NO 1, or a portion of SEQ ID NO 1, or a nucleic acid 
capable of hybridising therewith or is a nucleic acid encoding an amino acid sequence 
10 essentially similar to an amino acid as represented by SEQ ID NO 2, or a homofogue, 
derivative or active fragment thereof. 

Advantageously, the methods according to the invention may also be practised using variant 
2xC2H2 zinc finger nucleic acids and variant 2xC2H2 zinc finger amino acids, preferably 
wherein the variant nucleic acids are variants of SEQ ID NO 1 and wherein the variant amino 
acids are variants of SEQ ID NO 2. Examples of variant sequences suitable in performing the 
methods of the invention include: 

(i) Functional portions of a 2xC2H2 zinc finger nucleic acid/gene; 

(ii) Sequences capable of hybridising with a 2xC2H2 zinc finger nucleic acid/gene; 

(iii) Alternative splice variants of a 2xC2H2 zinc finger nucleic acid/gene; 

(iv) Allelic variants of a 2xC2H2 zinc finger nucleic acid/gene; 

(v) Homologues, derivatives and active fragments of a 2xC2H2 zinc finger protein. 

The abovementioned variants may also be described as being "essentially similar" to a 
2xC2H2 zinc finger nucleic acid/gene, particularly to the 2xC2H2 zinc finger encoding nucleic 
acid of SED ID NO 1, or essentially similar to a 2xC2H2 zinc finger amino acid/protein, 
particularly that of SED ID NO 2. The term "essentially similar to" also includes variants of 
SEQ ID NO 1 in the form of a complement, DNA, RNA, cDNA or genomic DNA. The variant 
nucleic acid encoding a 2xC2H2 zinc finger protein or the variant of a 2xC2H2 zinc finger 
protein may be synthesized in whole or in part, it may be a double-stranded nucleic acid or a 
single-stranded nucleic acid. Also, the term encompasses a variant due to the degeneracy of 
the genetic code; a family member of the gene or protein; and variants that are interrupted by 
one or more intervening sequences. 

35 An example of a variant 2xC2H2 zinc finger nucleic acid is a functional portion of a 2xC2H2 
zinc-finger gene. Advantageously, the method according to the present invention may also be 
practised using portions of a DNA or nucleic acid encoding a 2xC2H2 zinc finger protein. A 
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functional portion refers to a piece of DNA derived or prepared from an original (larger) DNA 
molecule, which DNA portion, when expressed in a plant, gives plants having modified growth 
characteristics. The portion may comprise many genes, with or without additional control 
elements or may contain spacer sequences. The portion may be made by making one or 
5 more deletions and/or truncations to the nucleic acid. Techniques for introducing truncations 
and deletions into a nucleic acid are well known in the art. Portions suitable for use in the 
methods according to the invention may readily be determined by following the methods 
described in the Examples section by simply substituting the sequence used in the actual 
Example with the portion to be tested for functionality. 

10 

An example of a further variant 2xC2H2 zinc finger nucleic acid is a sequence that is capable 
of hybridising to a 2xC2H2 zinc finger nucleic acid, for example to any of SEQ ID NO 1, 10, 12, 
14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 41, 43, 45, 47 or 49. Advantageously, the 
methods according to the present invention may also be practised using these variants. 
15 Hybridising sequences suitable for use in the methods according to the invention may readily 
be determined for example by following the methods described in the Examples section by 
simply substituting the sequence used in the actual Example with the hybridising sequence. 

The term "hybridisation" as defined herein is a process wherein substantially homologous 

20 complementary nucleotide sequences anneal to each other. The hybridisation process can 
occur entirely in solution, i.e. both complementary nucleic acids are in solution. Tools in 
molecular biology relying on such a process include the polymerase chain reaction (PCR; and 
all methods based thereon), subtractive hybridisation, random primer extension, nuclease S1 
mapping, primer extension, reverse transcription, cDNA synthesis, differential display of RNAs, 

25 and DNA sequence determination. The hybridisation process can also occur with one of the 
complementary nucleic acids immobilised to a matrix such as magnetic beads, Sepharose 
beads or any other resin. Tools in molecular biology relying on such a process include the 
isolation of poly (A+) mRNA. The hybridisation process can furthermore occur with one of the 
complementary nucleic acids immobilised to a solid support such as a nitro-cellulose or nylon 

30 membrane or immobilised by e.g. photolithography to, for example, a siliceous glass support 
(the latter known as nucleic acid arrays or microarrays or as nucleic acid chips). Tools in 
molecular biology relying on such a process include RNA and DNA gel blot analysis, colony 
hybridisation, plaque hybridisation, in situ hybridisation and microarray hybridisation. In order 
to allow hybridisation to occur, the nucleic acid molecules are generally thermally or chemically 

35 denatured to melt a double strand into two single strands and/or to remove hairpins or other 
secondary structures from single stranded nucleic acids. The stringency of hybridisation is 
influenced by conditions such as temperature, salt concentration and hybridisation buffer 
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composition. High stringency conditions for hybridisation include high temperature and/or low 
salt concentration (salts include NaCI and Na 3 -citrate) and/or the inclusion of formamide in the 
hybridisation buffer and/or lowering the concentration of compounds such as SDS (detergent) 
in the hybridisation buffer and/or exclusion of compounds such as dextran sulphate or 
5 polyethylene glycol (promoting molecular crowding) from the hybridisation buffer. Conventional 
hybridisation conditions are described in, for example, Sambrook (2001) Molecular Cloning: a 
laboratory manual, 3rd Edition Cold Spring Harbor Laboratory Press, CSH, New York, but the 
skilled craftsman will appreciate that numerous different hybridisation conditions may be 
designed in function of the known or the expected homology and/or length of the nucleic acid 

10 sequence. Sufficiently low stringency hybridisation conditions are particularly preferred (at 
least in the first instance) to isolate nucleic acids heterologous to the DNA sequences of the 
invention defined supra. An example of low stringency conditions is 4-6x SSC / 0.1-0.5% w/v 
SDS at 37-45°C for 2-3 hours. Depending on the source and concentration of the nucleic acid 
involved in the hybridisation, alternative conditions of stringency may be employed, such as 

15 medium stringency conditions. Examples of medium stringency conditions include 1-4x SSC / 
0.25% w/v SDS at £ 45°C for 2-3 hours. An example of high stringency conditions includes 0.1 
to 2x SSC / 0.1% w/v SDS at 60°C for 1-3 hours. The skilled man will be aware of various 
parameters which may be altered during hybridisation and washing and which will either 
maintain or change the stringency conditions. The stringency conditions may start low and be 

20 progressively increased until there is provided a hybridising nucleic acid, as defined 
hereinabove. Elements contributing to heterology include allelism, degeneration of the genetic 
code and differences in preferred codon usage. 

Another variant 2xC2H2 zinc finger nucleic acid useful in practising the methods according to 
25 the present invention is an alternative splice variant of a nucleic acid sequence encoding a 
2xC2H2 zinc finger protein. The term "alternative splice variant" as used herein encompasses 
variants of a nucleic acid sequence in which selected introns and/or exons have been excised, 
replaced or added. Such splice variants may be found in nature or may be manmade. Methods 
for making such splice variants are well known in the art. Splice variants suitable for use in the 
30 methods according to the invention may readily be determined for example by following the 
methods described in the Examples section by simply substituting the sequence used in the 
actual Example with the splice variant. 

Another variant 2xC2H2 zinc finger nucleic acid useful in practising the methods according to 
35 the present invention is an allelic variant of a nucleic acid encoding a 2xC2H2 zinc finger 
protein. Allelic variants exist in nature and encompassed within the methods of the present 
invention is the use of these natural alleles. Allelic variants also encompass Single Nucleotide 
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Polymorphisms (SNPs), as well as Small Insertion/Deletion Polymorphisms (INDELs). The size 
of INDELs is usually less than 100 bp. SNPs and INDELs form the largest set of sequence 
variants in naturally occurring polymorphic strains of most organisms. Allelic variants suitable 
for use in the methods according to the invention may readily be determined for example by 
5 following the methods described in the Examples section by simply substituting the sequence 
used in the actual Example with the allelic variant. 

The present invention provides a method for modifying plant growth characteristics, comprising 
modifying expression in a plant of an alternative splice variant or expression in a plant of an 
10 allelic variant of a nucleic acid encoding a 2xC2H2 zinc finger protein and/or by modifying level 
and/or activity in a plant of a 2xC2H2 zinc finger protein encoded by the alternative splice 
variant or allelic variant. 

Examples of variant 2xC2H2 zinc finger proteins useful in practicing the methods of the 
15 present invention are homologues, derivatives or functional fragments of a 2xC2H2 zinc finger 
protein. 

"Homologues" of a 2xC2H2 zinc finger protein encompass peptides, oligopeptides, 
polypeptides, proteins and enzymes having amino acid substitutions, deletions and/or 

20 insertions relative to the unmodified protein in question and having similar biological and 
functional activity as the unmodified protein from which they are derived. To produce such 
homologues, amino acids of the protein may be replaced by other amino acids having similar 
properties (such as similar hydrophobicity, hydrophilicity, antigenicity, propensity to form or 
break cc-helical structures or p-sheet structures). Conservative substitution tables are well 

25 known in the art (see for example Creighton (1984) Proteins. W.H. Freeman and Company). 
The homologues useful in the method according to the invention have at least in increasing 
order of preference 30%, 31%, 32%, 33%, 34%, 35%, 36%, 37%, 38%, 39%, 40%, 41%, 42%, 
43%, 44%, 45%, 46%, 47%, 48%, 49%, 50%, 52%, 54%, 56%, 58%, 60%, 62%, 64%, 66%, 
68%, 70%, 72%, 74%, 76%, 78%, 80%, 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98% 

30 sequence identity or similarity to an unmodified protein. 

The percentage of identity may be calculated by using an alignment program well known in the 
art. For example, the percentage of identity may be calculated using the program GAP, or 
needle (EMBOSS package) or stretcher (EMBOSS package) or the program align X, as a 
35 module of the vector NTI suite 5.5 software package, using the standard parameters (for 
example GAP penalty 5, GAP opening penalty 15, GAP extension penalty 6.6). 
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According to another embodiment of the present invention, the nucleic acid sequence useful in 
the methods of the present invention is a nucleic acid encoding a protein homologous to SEQ 
ID NO 2. 

Methods for the search and identification of 2xC2H2 zinc finger protein homologues, for 
5 example STZ zinc finger homologues, would be well within the realm of a person skilled in the 
art. Such methods, involve screening sequence databases with the sequences provided by the 
present invention, for example SEQ ID NO 2 (or SEQ ID NO 1), preferably in a computer 
readable format. This sequence information may be available in public databases, that include 
but are not limited to Genbank ( http://www.ncbLnlm>nih,qov/web/Genbank ) < the European 

10 Molecular Biology Laboratory Nucleic acid Database (EMBL) (http:/w.ebi.ac.uk/ebi-docs/embI- 
db.html) or versions thereof or the MIPS database (http://mips.gsf.de/). Different search 
algorithms and software for the alignment and comparison of sequences are well known in the 
art. Such methods include GAP, BESTFIT, BLAST, FASTA and TFASTA. GAP uses the 
algorithm of Needleman and Wunsch (J. Mol. Biol. 48: 443-453, 1970) to find the alignment of 

15 two complete sequences that maximises the number of matches and minimises the number of 
gaps. The BLAST algorithm calculates percent sequence identity and performs a statistical 
analysis of the similarity between the two sequences. The suite of programs referred to as 
BLAST programs has 5 different implementations: three designed for nucleotide sequence 
queries (BLASTN, BLASTX, and TBLASTX) and two designed for protein sequence queries 

20 - (BLASTP and TBLASTN) (Coulson, Trends in Biotechnology: 76-80, 1994; Birren et al., 
GenomeAnalysis, 1: 543, 1997). The software for performing BLAST analysis is publicly 
available through the National Centre for Biotechnology Information. 

Default blast parameters to find useful homologues of any of SEQ ID NO 1, SEQ ID NO 2 or 
25 any of SEQ ID NO 10 to SEQ ID NO 50, are, when comparing nucleotide sequence G (Cost to 
open a gap) 5, E (Cost to extend a gap default) 2, q (Penalty for a mismatch) -3, r (Reward for 
a match) 1, e (Expectation value (E)) 10.0, W (Word size) 11, V (Number of one-line 
descriptions) 100 and B (Number of alignments to show) 100. When comparing protein 
sequences, the default parameters are preferably G 11, E 1, e value 10.0, W 3, V 100 and B 
30 100. 

The above-mentioned analyses for comparing sequences, for the calculation of sequence 
identity and for the search for homologues, is preferentially done with full-length sequences or 
within a conserved region of the sequence. Therefore, these analyses may be based on a 
35 comparison of certain regions such as conserved domains, motifs or boxes. 
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The identification of such domains or motifs for examples the motif and boxes as represented 
by SEQ ID NO 5, 6, 7, 8 and 9, would also be well within the realm of a person skilled in the art 
and involves for example, a computer readable format of proteins of the present invention, the 
use of alignment software programs and the use of publicly available information on protein 
5 domains, conserved motifs and boxes. This protein domain information is available in the 
PRODOM (http://www.biochem.ucl.ac.uk/^ PIR 
(http://pir.georgetown.edu/) or pFAM (http://pfam.wustl.edu/) database. For the identification of 
Zinc finger domains, such as the 2xC2H2 zinc finger domain, pFAM is preferred. Sequence 
analysis programs designed for motif searching may be used for identification of fragments, 

10 regions and conserved domains as mentioned above. Preferred computer programs would 
include but are not limited to MEME, SIGNALSCAN, and GENESCAN. A MEME algorithm 
(Version 3.0) may be found in the GCG package; or on the Internet site 
http://www.sdsc.edu/MEME/meme. SIGNALSCAN version 4.0 information is available on the 
Internet site http://biosci.cbs.umn.edu/software/sigscan.html. GENESCAN may be found on 

15 the Internet site http://gnomic.stanford.edu/GENESCANW.html. 

At present, zinc finger motifs are subdivided in more than 40 different classes as can be found 
in the Pfam database of protein families present at the Sanger institute 
(http://www.sanger.ac.Uk/Software/Pfam/browse/Z.shtml). 

20 

The C2H2 zinc finger (Zf-C2H2) motif is the classical zinc finger domain. It was first recognized 
in the transcription factor IIIA (TFIIIA) of Xenopus (Miller at al. 1985). The domain is typically 
25 to 30 amino-acid residues in length. The following pattern describes the zinc finger *X-C- 
X(1-5)-C-X3-*X5-*X2-H-X(3-6)-[H/C] where X can be any amino acid, and numbers in brackets 

25 indicate the number of residues. The positions marked * are those that are important for the 
stable folding of the zinc finger. The final position can be either his or cys, while still being a 
C2H2 zinc finger domain. In view of recent publications on the design of zinc finger domains it 
becomes feasible also to replace one or more of the Cys or His amino acids, whilst still 
retaining the original functionality of the C2H2 domain. The residues separating the second 

30 Cys and the first His are mainly polar and basic. The canonical C2H2 zinc finger is composed 
of two short beta strands followed by an alpha helix. DNA binding of the zinc finger motif is 
mediated by amino terminal part of the alpha helix which binds the major groove in DNA 
binding zinc fingers. C2H2 domains have been shown to interact with RNA, DNA and proteins. 
The tetracoordination of a Zinc ion by the conserved cystein and histidine residues determines 

35 the conserved tertiary structure of the motif. Conserved hydrophobic residues are commonly 
found at positions -2 and also at 4 amino acids after the second cystein (that participates in 
zinc binding) and at position three before the first histidine (that participates in zinc binding). In 
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plant multi zinc finger proteins, spacing between the C2H2 domains is generally about 15 to 
about 65 amino acids. 

Thus, plant zinc finger proteins are characterized by long spacers of diverse lengths between 
5 adjacent fingers. Moreover, they are characterised by a highly conserved sequence of six 
amino acids, located within a putative DNA-contacting surface of each finger. Two forms of 
such conserved sequence are most commonly found in plant C2H2 zinc fingers, the QALGGH 
(SEQ ID NO 5) and the NNM/WQMH (SEQ ID NO 6). Despite the high sequence conservation 
of the QALGGH, some variants or the so-called 'modified type' occur in nature where one or 

10 two amino acids can have a different form, most typically the +1 can be a "GY K" or 

(these amino acids share the same turn-like characteristic), the +2 "A" can be "S" (both of 
which share the characteristic of being small amino acids) or the +3" "L can be T" (these two 
amino acids are both hydrophobic). The QALGGH-motif as used herein comprises all these 
variants. In the NNM/WQMH motif at position 3 there is mostly an "M" or a "W. 

15 Therefore, the present invention provides a method as described hereinabove, wherein said 
2xC2H2 zinc finger protein comprises a QALGGH motif. Further, The present invention 
provides a s described hereinabove, wherein said 2xC2H2 zinc finger protein comprises a 
NNM/WQMH motif. 

20 According to one embodiment of the invention, both C2H2 domains are of the same type. More 
preferably, both C2H2 zinc finger domains have the same conserved GALGGH or 
NNM/WQMH motif. According to another embodiment, each C2H2 zinc finger domain has a 
different conserved motif. 

25 According to one embodiment, the 2xC2H2 protein useful in the methods of the present 
invention is characterized by an EAR motif, which is an ERF-Associated amphiphilic 
repression motif. This motif has been recognized in two unrelated types of transcription factors, 
namely the ERF transcription factors of the AP2 type and in the zinc finger transcription 
factors. In the latter class, the EAR motif is generally located at the C-terminus of the protein. 

30 The pattern for the EAR motif has the conserved sequence hDLNh(X)P (SEQ ID NO 7), where 
"h" is a hydrophobic residue (any one of A > C,F I G,H,I,K > L J M,R I T 1 V,W,Y) most typically L/F/l and 
where "X" can be one (any amino acid) or no amino acid. A characteristic feature of the EAR 
motif is the alternation of hydrophilic and hydrophobic residues with the aspartic acid (D) 
residue being amphiphilic. Ohta et al. (The plant cell, 2001, 13, p1 959-1 968), which reference 

35 is cited herein by reference, previously characterized EAR motifs present in 2xC2H2 zinc 
finger proteins. 
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Therefore, the present invention provides a method as described hereinabove, wherein the 
2xC2H2 zinc finger protein comprises an EAR motif. According to one embodiment, the EAR 
motif is located in the C-terminal region of the protein, preferably between the second zinc 
finger domain and the C-terminus. 

5 

According to a further embodiment, the zinc finger proteins used in the methods of the present 
invention have two zinc finger domains and a nuclear localization signal (B-box). A cluster of 
basic amino acids that resembles the B-box (Basic box) were described by Chua et al. (EMBO 
1992- 11, 241 -9) and were hypothesized to be a nuclear localization signal for the protein. 

10 These have been recognized in 2xC2H2 proteins (Sakamoto et al., Gene 248 (2000) 23-32). 
The cluster is rich in Lysine (K) and Arginine (R) residues. A consensus sequence defining the 
most frequent form of the B-box in 2xC2H2 genes is KR(S)KRXR (SEQ ID NO 8) where "S" at 
the 3rd position may be absent or present. However other variants may occur in nature that 
still retain the characteristic of being a charged region rich in basic amino acids. The location of 

15 the basic box is most frequently at the N-terminus of the protein, but can also occur in other 
locations. It has been speculated that due to its basic nature the B-box could also participate in 
DNA binding. 

Accordingly, the present invention provides a method as described hereinabove, wherein the 
20 2xC2H2 zinc finger protein further comprises a B-box. According to one embodiment the B-box 
is located in the N-terminal region of the zinc finger protein. Preferably the proteins useful in 
the methods of the present invention have a B-box located between the N-terminus and the 
first zinc finger domain, 

25 According to a further embodiment, the zinc finger proteins useful in the methods of the 
present invention have two C2H2 zinc finger domains and an L-box. A conserved motif, named 
L-box, of yet unknown function has been identified in 2xC2H2 proteins and has been described 
previously by Sakamoto et al. (Gene 248 (2000) 23-32). The L-box is typically located at the N- 
terminus, between the B-box and the first C2H2 zinc finger. The L-box is represented by the 

30 sequence EXEXXAXCLXXL (SEQ ID NO 9). This region may be involved in protein-protein 
interactions. Zinc finger proteins lacking the L-box, may for example have serine rich regions at 
a similar position, which regions are putative sites for protein-protein interactions. 

Therefore, the present invention provides a method as described hereinabove, wherein the 
35 2xC2H2 protein comprises an L-box. 
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Particular zinc finger homologues useful in the methods of the present invention have one or 
more of the conserved motifs as depicted in SEQ ID NO 5, 6, 7, 8 and 9, or motifs that are 
80% identical to these motifs or motifs that have conserved substitutions of amino acids. The 
2xC2H2 protein as set forth in SEQ ID NO 2 comprises all the boxes as set forth in SEQ ID NO 
5 5, 7, 8 and 9. All its paralogues and orthologues also comprise all of these boxes. 

Homologues of a 2xC2H2 protein as presented in SEQ ID NO 2 and isolated from Arabidopsis 
thaliana, that are useful in the constructs and the methods of the present invention are also 
identified in other plant species. 

10 

Two special forms of homologue, orthologues and paralogues, are evolutionary concepts used 
to describe ancestral relationships of genes. The term "paralogue" relates to a gene- 
duplication within the genome of a species leading to paralogous genes. The term "orthologue" 
relates to a homologous gene in different organisms due to ancestral relationship. The term 
15 "homologue" as used herein also encompasses paralogues and orthologues of the proteins 
useful in the methods according to the invention. 

Othologues in other plant species may easily be found by performing a so-called reciprocal 
blast search. Orthologous genes can be identified by querying one or more gene databases 

20 with a query gene or protein of interest (SEQ ID NO 1 or 2), using for example BLAST 
program. The highest-ranking subject genes that result from the search are then again 
subjected to a BLAST analysis, and only those subject genes that match again with the query 
sequence (SEQ ID NO 1 or 2) are retained as true orthoJogous genes. For example, to find a 
rice orthologue of an Arabidopsis thaliana gene, one may perform a BLASTN or TBLASTX 

25 analysis on a rice database such as (but not limited to) the Oryza sativa N/pponbare database 
available at the NCBI website (http://www.ncbi.nlm.nih.gov) or the genomic sequences of rice 
(cultivars indica or japonica). In a next step, the obtained rice sequences are used in a reverse 
BLAST analysis using an Arabidopsis database. The results may be further refined when the 
resulting sequences are analysed with ClustalW and visualised in a neighbour joining tree. 

30 The method can be used to identify orthologues from many different species. 

The closest homologues in other species (orthologues of the protein of SEQ ID NO 2), include 
those from a variety of dicot and monocot plants, for example from Datisca glomerata 
(AF119050_1, AAD26942, SEQ ID NO 10 and 11), from soybean (T09602, SCOF-1, SEQ ID 
35 NO 12 and 13), Medicago sativa (CAB77055.1, SEQ ID NO 14 and 15), from tobacco 
(T01985, SEQ ID NO 16 and 17) from rice, (AF332876JI, AAK01713.1, SEQ ID NO 18 and 
19), from petunia (BAA05079.1, SEQ ID NO 20 and 21), from wheat (S39045 and BAA03901, 
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WZF1, SEQ ID NO 22 and 23), from Capsicum annum (SEQ ID NO 24 and 25), from turnip 
(T14408, T14409) and from sugarcane (CA279020). 

Close homologues of the same species ( paralogues of the protein of SEQ ID NO 2 from 
5 Arabidopsis thaliana) are described below. 

The MIPS database contains the sequence of the Arabidopsis thaliana genome with prediction 
and functional annotation of the proteins encoded. Searching this database with the STZ gene 
of SEQ ID NO 1 (MIPS accession number At1g27730), showed that in the Arabidopsis 
genome there are 2 genes encoding very close homologues of SEQ ID NO 2, At5g43170 

10 (NM_123683, SEQ ID NO 32 and 33) and At5g04340 (NM_120516 SEQ ID NO 28 and 29), 
and 3 others with high similarity: At3g19580 (NM_1 12848, SEQ ID NO 26 and 27), At5g67450 
(NM_126145, SEQ ID NO 34 and 35) and At3g49930 (NM_1 14853, SEQ ID NO 30-31). These 
genes are spread over 3 chromosomes, 1, 3 and 5. Similarly, a number of paralogues of the 
orthologue in Petunia have been isolated and sequenced. Advantageously, paralogues from 

15 the same species may be used in the methods of the present invention. 

Furthermore, a number of family members of the STZ protein of SEQ ID NO 2 have been 
found in Arabisopsis. The STZ gene and protein of SEQ ID NO 1 and 2 have been previously 
published in the database under the MIPS accession number At1g27730 or in Genbank under 

20 the accession numbers NP_1 74094.1, X95573 or CAA64820. Additionally, several other 
cDNA's, isolated from other tissues or at different developmental stages of Arabisopsis have 
been reported and encode the same protein as that of SEQ ID NO 2. S uch sequences 
sequences deposited under the Genbank accession number AY034998, NM_1 02538, 
AC12375, X95573, AY063006, X98671, X98670, or AF250336. These isolates illustrate the 

25 differential expression of the STZ gene in different plant tissues at different developmental 
stages. The differential regulation of these different cDNA's is reflected by the differences at 
their 5'UTR and the 3'UTR regions, while the encoded protein remains the same. 
Advantageously, the members of the same gene family as SEQ ID NO 1 or the members of 
the same family of any of the orthologues of SEQ ID NO 1 , may be used in the methods of the 

30 present invention. 

Other close homologues useful in the methods of the present invention are the sequences as 
deposited in the public database under the following accession numbers, which sequences are 
herein incorporated by reference: homologues isolated from Petunia: BAA21923.1, 
35 BAA21922.1, BAA21926.1, BAA21925.1, BAA19110.1, BAA19926.1, BAA21924.1, 
BAA19111.1, BAA21921.1, BAA19114.1, BAA05076.1, BAA05079.1, CAA43111.1, 
BAA21920.1, BAA21919.1, BAA05077.1, BAA05078.1, BAA20137.1; homologues isolated 
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from Arabidopsis: CAA67229.1, BAC43454.1, NP_196054.1, AAM67193.1, NPJI99131.1, 
NP_188592.1 , NP_201 546.1 , NP_1 90562.1 , NP_182037.1 , BAC43008.1 , Q8VWG3, 
CAC86393. 1 , CAC86168.1 , CAC861 67.1 , CAC861 66.1 , CAB67667.1 , CAC01747.1 , 
CAB90936.1, CAB90935.1, CAB80245.1, CAB41188.1, GAA18741.1, CAA67234.1, 
5 CAA67236.1, CAA67231.1, CAA67230.1, CAA67228.1, CAA67235.1, CAA67233.1, 
CAA67232.1, CAA67229.1, CAA64820.1 and homologues isolated from rice: BAB 16855.1, 
AAO06972.1 , CAC09475.1 , BAB63718.1 , P0683F02.21 , BAB67885. 1 , P0031 D1 1 .1 9, 
BAB64114.1, AAK01713.1, AF332876_1, AAL76091.1, BAB67879.1, P0031D11.12 and 
BAC15513.1. 

10 

A phylogenetic tree may be constructed with all the homologues, paralogues and orthologues 
are defined herein above. Multiple alignment q may be made using ciustal W present in the 
VIMTi (version 5.0) program with for example Gap opening penalty 10 and Gap extention 5. For 
making a phylogenetic tree the Phylic software package available at 
15 httD://evolution.genetics.washington.edu/phvrip.html may be used. Sequences clustering 
around SEQ ID NO 1 or SEQ ID NO 2, identify genes or proteins suitable for use in the 
methods of the present invention. 

The sequence of SEQ ID NO 2 and its rice orthologue AF332876 (SEQ ID NO 19) have 36% 
20 sequence identity when using the program Needle with the parameters Gap penalty 5 and Gap 
extension penalty 6. Therefore, homologues particularly useful in the methods of the present 
invention are homologues having 36% or more sequence identity with the 2xC2H2 zinc finger 
protein as presented in SEQ ID NO 2 or having 36% or more sequence identity to the closest 
orthologue of SEQ ID NO 2 from another species. 

25 

Preferred homologues useful in practicing the methods of the present invention are plant 
homologues, i.e. proteins obtained from a plant nucleic acid. More preferably, the nucleic acid 
sequence is from a dicot, more preferably from the family Brassicaceae, further preferably from 
Arabidopsis thafiana. 

30 

Preferably the 2xC2H2 zinc finger protein useful in the methods of the present invention 
belongs to the same gene family as the salt tolerant zinc finger protein (STZ) of Arabidopsis 
thaliana, or is a homologues thereof. The name ZAT10 can also be used to identify the STZ 
zinc finger protein of Arabidopsis thafiana. 

35 

Another variant of a zinc finger protein useful in the methods of the present invention is a 
substitutional variant. The term "Substitutional variants" of a protein refers to those variants in 
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which at least one residue in an amino acid sequence has been removed and a different 
residue inserted in its place. Amino acid substitutions are typically of single residues, but may 
be clustered depending upon ftjnctional constraints placed upon the polypeptide; insertions will 
usually be of the order of about 1-10 amino acid residues, and deletions will range from about 
5 1-20 residues. Preferably, amino acid substitutions comprise conservative amino acid 
substitutions. Particular substitutional variants of the C2H2 zinc finger protein are substitutional 
variants in which one or more of the conserved Cys and/or His residues is replaced, whilst 
retaining the same zinc finger functionality. To retain the same functionality, the residues 
around these conserved Cys of His residues may also be substituted. 

10 

"Insertional variants" of a protein are those in which one or more amino acid residues are 
introduced into a predetermined site in said protein. Insertions can comprise amino-terminal 
and/or carboxy-terminal fusions as well as intra-sequence insertions of single or multiple amino 
acids. Generally, insertions within the amino acid sequence will be smaller than amino- or 

15 carboxy-terminal fusions, of the order of about 1 to 10 residues. Examples of amino- or 
carboxy-terminal fusion proteins or peptides include the binding domain or activation domain of 
a transcriptional activator as used in the yeast two-hybrid system, phage coat proteins, 
(histidine) 6 -tag, glutathione S-transferase-tag, protein A, maltose-binding protein, dihydrofolate 
reductase, Tag«100 epitope, c-myc epitope, FLAG®-epitope, lacZ, CMP (calmodulin-binding 

20 peptide), HA epitope, protein C epitope and VSV epitope. 

"Deletion variants" of a protein are characterised by the removal of one or more amino acids 
from the protein. Amino acid variants of a protein may readily be made using peptide synthetic 
techniques well known in the art, such as solid phase peptide synthesis and the like, or by 

25 recombinant DNA manipulations. Methods for the manipulation of DNA sequences to produce 
substitution, insertion or deletion variants of a protein are well known in the art. For example, 
techniques for making substitution mutations at predetermined sites in DNA are well known to 
those skilled in the art and include M13 mutagenesis, T7-Gen in vitro mutagenesis (USB, 
Cleveland, OH), QuickChange Site Directed mutagenesis (Stratagene, San Diego, CA), PCR- 

30 mediated site-directed mutagenesis or other site-directed mutagenesis protocols. 

The term "derivatives" refers to peptides, oligopeptides, polypeptides, proteins and enzymes 
which may comprise substitutions, deletions or additions of naturally and non-naturally 
occurring amino acid residues compared to the amino acid sequence of a naturally-occurring 
35 form of the 2xC2H2 protein such as for example the 2xC2H2 zinc finger protein as presented 
in SEQ ID NO 2. "Derivatives" of a 2xC2H2 zinc finger protein encompass peptides, 
oligopeptides, polypeptides, proteins and enzymes which may comprise naturally occurring 
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altered, glycosylated, acylated or non-naturally occurring amino acid residues compared to the 
amino acid sequence of a naturally-occurring form of the polypeptide. A derivative may also 
comprise one or more non -amino acid substituents compared to the amino acid sequence from 
which it is derived, for example a reporter molecule or other ligand, covalently or non- 
5 covalently bound to the amino acid sequence such as, for example, a reporter molecule which 
is bound to facilitate its detection, and non-naturally occurring amino acid residues relative to 
the amino acid sequence of a naturally-occurring protein. 

Another variant of a 2xC2H2 zinc finger protein useful in the methods of the present invention 
10 is an active fragment of a zinc finger protein. "Active fragments" of a 2xC2H2 zinc finger 
protein encompasses at least five contiguous amino acid residues of a protein, which residues 
retain similar biological and/or functional activity to the naturally occurring protein. For 
example, useful fragments comprise at least 10 contiguous amino acid residues of a 2xC2H2 
zinc finger protein. Other preferred fragments are fragments of a 2xC2H2 zinc finger protein 
15 starting at the second or third or further internal methionin residues. These fragments originate 
from protein translation, starting at internal ATG codons. Functional fragments of a 2xC2H2 
zinc finger protein useftjl in practising the methods of the present invention may have one, two 
or no C2H2 domains, without affecting its functionality in the methods of the present invention. 

20 According to a preferred feature of the present invention, enhanced or increased expression of 
a nucleic acid encoding a 2xC2H2 zinc finger protein is envisaged. Methods for obtaining 
enhanced or increased expression of genes or gene products are well documented in the art 
and include, for example, over-expression driven by a strong promoter, the use of transcription 
enhancers or translation enhancers. The term over-expression as used herein means any form 

25 of expression that is additional to the original wild-type expression level. Preferably the nucleic 
acid to be introduced into the plant and/or the nucleic acid that is to be overexpressed in the 
plant is in the sense direction with respect to the promoter to which it is operably linked. 
Preferably, the nucleic acid sequence represented by SED ID NO 1 is over-expressed in a 
plant. However, it should be clear that the applicability of the invention is not limited to use of 

30 the nucleic acid represented by SEQ ID NO 1 nor to the nucleic acid sequence encoding the 
amino acid sequence of SEQ ID NO 2, but that other nucleic acid sequences encoding 
homologues, derivatives or active fragments of SED ID NO 1 or SED ID NO 2 may be useful in 
the methods of the present invention. Examples of nucleic acids or proteins are provided in 
SEQ ID NO 10 to SEQ ID NO 50. 

35 

Alternatively and/or additionally, increased expression of a 2xC2H2 encoding gene or 
increased level and/or activity of a 2xC2H2 protein in a plant cell, is achieved by mutagenesis. 

18 



WO 2004/058980 



PCT/EP2003/051104 



For example these mutations may be responsible for altered control of the 2xC2H2 gene, 
resulting in more expression of the gene, relative to the wild-type gene. Mutations can also 
cause conformational changes in a protein, resulting in more activity and/or higher levels of the 
2xC2H2 protein. 

5 

Modifying gene expression (whether by a direct or indirect approach) encompasses altered 
transcript levels of a gene. Altered transcript levels may be sufficient to induce certain 
phenotypic effects, for example via the mechanism of cosuppression. Here the overall effect of 
introduction of a transgene is that there is less activity in the cell of the protein encoded by a 

10 native gene having homology to the introduced transgene. Therefore, according to another 
embodiment of the present invention, there is provided a method for modifying growth 
characteristics in a plant, comprising decreasing expression of a gene encoding a 2xC2H2 zinc 
finger protein or decreasing level and/or activity of a 2xC2H2 zinc finger protein. Examples of 
decreasing expression, level and/or activity of a protein in a cell are well documented in the art 

15 and include, for example, down regulation of expression by anti-sense techniques, RNAi 
techniques, small interference RNAs (siRNAs) and microRNA (miRNA) . 

Another method for downregulation of gene expression or gene silencing comprises use of 
ribozymes, for example as described in Atkins et al. 1994 (WO 94/00012), Lenee et al. 1995 
20 (WO 95/03404), Lutziger et al. 2000 (WO 00/00619), Prinsen et al. 1997 (WO 97/3865) and 
Scott et al. 1997 (WO 97/38116). 

Gene silencing may also be achieved by insertion mutagenesis (for example, T-DNA insertion 
or transposon insertion) or by gene silencing strategies as described by, among others, Angell 
25 and Baulcombe 1998 (WO 98/36083), Lowe et al. 1989 (WO 98/53083), Lederer et al. 1999 
(WO 99/15682) or Wang et al. 1999 (WO 99/53050). 

Expression of an endogenous gene may also be reduced if it contains a mutation. Such a 
mutation or such a mutant gene may be isolated and introduced into the same or different 
30 plant species in order to obtain plants having modified growth characteristics. Examples of 
such mutants are dominant negative mutants of a 2xC2H2 zinc finger gene. 

Genetic constructs aimed at silencing gene expression may comprise the 2xC2H2 zinc finger 
nucleic acid, for example as represented by SEQ ID NO 1 (or one or more portions thereof or a 
35 sequence capable of hybridising therewith), in a sense and/or antisense orientation relative to 
the promoter sequence. The sense or antisense copies of at least part of the endogenous 
gene in the form of direct or inverted repeats may also be utilised in the methods according to 
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the invention. The growth characteristics of plants may also be modified by introducing into a 
plant at least part of an antisense version of the nucleotide sequence represented by SEQ ID 
N01. 

5 According to a further embodiment of the present invention, genetic constructs and vectors to 
facilitate introduction and/or to facilitate expression of the 2xC2H2 zinc finger nucleotide 
sequences useful in the methods according to the invention are provided. Therefore, according 
to the present invention, there is provided a construct comprising: 

(i) a nucleic acid capable of modifying expression of a nucleic acid encoding a 
10 2xC2H2 zinc finger protein and/or modifying level and/or activity of a 2xC2H2 zinc finger 

protein; 

(ii) one or more control sequence capable of driving expression of the nucleic acid 
sequence of (i); and optionally 

(iii) a transcription termination sequence. 

15 

Constructs useful in the methods according to the present invention may be constructed using 
recombinant DNA technology well known to persons skilled in the art. The gene constructs 
may be inserted into vectors, which may be commercially available, suitable for transforming 
into plants and suitable for expression of the gene of interest in the transformed cells. 
20 Preferably the genetic construct is a plant expression vector. 

The nucleic acid according to (i) is advantageously any of the nucleic acids described 
hereinbefore. A preferred nucleic acid is the nucleic acid represented by SEQ ID NO 1 or a 
variant thereof as hereinbefore defined, or is a nucleic acid sequence encoding a sequence 
25 represented by SEQ ID NO 2 or a variant as hereinbefore defined. For example such variants 
encode a protein as presented in any of SEQ ID NO 1 1, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 
33, 35, 37, 39, 42, 44, 46, 48 and 50. 

The terms "regulatory element" and "control sequence" are used herein interchangeably and 
30 are to be taken in a broad context to refer to regulatory nucleic acids capable of effecting 
expression of the sequences to which they are operably linked. Encompassed by the 
aforementioned terms are promoters. A "promoter" encompasses transcriptional regulatory 
sequences derived from a classical eukaryotic genomic gene (including the TATA box which is 
required for accurate transcription initiation, with or without a CCAAT box sequence) and 
35 additional regulatory elements (i.e. upstream activating sequences, enhancers and silencers) 
which alter gene expression in response to developmental and/or external stimuli, or in a 
tissue-specific manner. Also included within the term is a transcriptional regulatory sequence 
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of a classical prokaryotic gene, in which case it may include a --35 box sequence and/or -10 
box transcriptional regulatory sequences. The term "regulatory elemenf also encompasses a 
synthetic fusion molecule or derivative which confers, activates or enhances expression of a 
nucleic acid molecule in a cell, tissue or organ. The term "operably linked" as used herein 
5 refers to a functional linkage between the promoter sequence and the gene of interest, such 
that the promoter sequence is able to initiate transcription of the gene of interest. Preferably, 
the gene of interest is operably linked to a promoter in a sense direction. 

Advantageously, any type of promoter may be used to drive expression of the nucleic acid 
10 sequence depending on the desired outcome. 

Promoters useful for the present invention are described in EP 03075331.3, which promoters 
and sequences are incorporated herein by reference. 

Other examples of preferred promoters are presented in Table I (a) to (c), which promoters or 
15 derivatives thereof are useful in the methods and/or in making the constructs of the present 
invention. Accordingly, genetic constructs comprising of the nucleic acids of (i), for example a 
2xC2H2 nucleic acid, and at least part of a promoter from Table 1 (a) to (c) or from EP 
03075331.3, preferably, wherein said parts are operably linked, are also provided by the 
present invention. 

20 

According to onother embodiment, the nucleic acid of (i) is operably linked to a constitutive 
promoter. The term "constitutive" as defined herein refers to a promoter that is expressed 
substantially continuously. Furthermore, preferably the constitutive promoter is a ubiquitous 
promoter, which is expressed in more than one, preferably in most or substantially all tissues 

25 of the plant. Preferably, the constitutive promoter to be used in the methods of the present 
invention, or cloned in the genetic constructs of the present invention, is a plant promoter, 
preferably a constitutive promoter, such as a GOS2 promoter or a promoter with similar 
strength and/or similar expression pattern. Preferably plant promoters derived from a plant 
nucleic acid are used. Alternatively, promoters operable in plant, such as promoters derived 

30 from plant pathogens are used. 

According to another embodiment of the invention, the nucleic acid of (i) is operably linked to a 
plant promoter, preferably a tissue-preferred promoter. The term "tissue-preferred" as used 
herein refers to a promoter that is expressed predominantly in at least one tissue or organ. For 
35 example, the tissue-preferred promoter is a seed-preferred promoter, such as a pWS18 
(Joshee et al. Plant Cell Physiol. 1998 Jan;39(1):64-72. ) or a promoter of similar strength 
and/or similar expression pattern. 
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Promoters with similar strength and/or similar expression pattern may be found by coupling the 
promoter to a reporter gene and checking the function of the reporter gene in different tissues 
of a plant. One suitable reporter gene is beta-glucuronidase and the colorimetric GUS staining 
5 to visualize the beta-glucuronidase activity in a plant tissue is well known to a person skilled in 
the art. 

Table I (a): flower preferred promoters useful in the present invention. Sequences of these 
promoters are described in the cited reference, which sequences are herein incorporated by 
10 reference. 



Gene 


Expression 


Reference 


AtPRP4 


flowers 


http://salus.medium.edu/mmg/tierney/html 


chalene synthase (chsA) 


flowers 


Van der Meer, et al., Plant MoL Biol. 15, 95- 
109, 1990. 


LAT52 


anther 


Twel! etal Mol. Gen Genet. 217:240-245 
(1989) 


apetala-3 


flowers 





Table f (b): seed-preferred promoters useful in the present invention. Sequences of these 
15 promoters are described in the cited reference, which sequences are herein incorporated by 
reference. 



Gene 


Expression 


Reference 


seed-specific genes 


seed 


Simon, et at., Plant Mol. Biol. 5: 191, 1985; 
Scofield, et a/., J. Biol. Chern. 262: 12202, 
1987.; Baszczynski, etal., Plant MoL Biol. 14: 
633, 1990. 


Brazil Nut albumin 


seed 


Pearson, et al., Plant MoL Biol. 18: 235-245, 
1992. 


legumin 


seed 


Ellis, et al., Plant Mol. Biol. 10: 203-214, 
1988. 


glutelin (rice) 


seed 


Takaiwa, et al., Mol. Gen. Genet. 208: 15-22, 
1986; Takaiwa, et al., FEBS Letts. 221: 43- 
47, 1987. 


zein 


seed 


Maizke et al Plant Mol Biol, 14(3):323-32 
1990 


napA 


seed 


Stalberg, et al, Planta 700:515-519, 1996. 



22 



WO 2004/058980 



PCT/EP2003/051104 



wheat LMW and HMW glutenin-1 


endosperm 


Mol Gen Genet 216:81-90, 1989; NAR 
17:461-2, 1989 


wheat SPA 


seed 


Albani et al, Plant Cell, 9: 171-184, 1997 


wheat a, J3, 7-gliadins 


endosperm 


EMBO 3:1409-15,1984 


barley Itrl promoter 


endosperm 




barley B1, C, D, hordein 


endosperm 


Theor Appl Gen 98:1253-62, 1999; Plant J 
4:343-55, 1993; Mol Gen Genet 250:750-60, 
1996 


barley DOF 


endosperm 


Mena et al, The Plant Journal, 116(1): 53-62, 
1998 


blz2 


endosperm 


EP991 06056.7 


synthetic promoter 


endosperm 


Vicente-Carbajosa et a/., Plant J. 13: 629 - 
640, 1998. 


rice prolamin NRP33 


endosperm 


Wu et al r Plant Cell Physiology 39(8) 885- 
889, 1998 


rice a-globulin Glb-1 


endosperm 


Wu et al, Plant Cell Physiology 39(8) 885- 
889, 1998 


rice OSH1 


embryo 


Sato et a/, Proc. Natl. Acad. Scl. USA, 93: 
8117-8122, 1996 


rice a-globulin REB/OHP-1 


endosperm 


Nakase et al. Plant Mol. Biol. 33: 513-522, 
1997 


rice ADP-glucose PP 


endosperm 


Trans Res 6:157-68, 1997 


maize ESR gene family 


endosperm 


Plant J 12:235-46, 1997 


sorgum y-kafirin 


endosperm 


PMB 32:1029-35, 1996 


KNOX 


embryo 


Postma-Haarsma et a/, Plant Mol. Biol. 
39:257-71, 1999 


rice oleosin 


embryo and 
aleuron 


Wu etat f J. Biochem., 123:386, 1998 


sunflower oleosin 


seed (embryo 
and dry seed) 


Cummins, et a/., Plant Mol. Biol. 19: 873-876, 
1992 



Table I (c): constitutive promoters useful in the present invention. Sequences of these 
promoters are described in the cited reference, which sequences are herein incorporated by 
reference. 



Gene 


Expression 


Reference 


Actin 


constitutive 


McElroy etal, Plant Cell, 2: 163-171, 
1990 


CAMV 35S 


constitutive 


Odeil et al, Nature, 313: 810-812, 
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1985 


CaMV19S 


constitutive 


Nilsson et al, Physiol. Plant 
fOO'456-462 1997 


GOS2 


constitutive 


de Pater et al, Plant J Nov;2(6):837- 
44 1992 


ubiquitin 


constitutive 


Christensen et al, Plant Mol. Biol. 
18: 675-689, 1992 


rice cyclophilin 


constitutive 


Buchholz etal, Plant Mol Biol. 25(5): 
837-43, 1994 


maize H3 histone 


constitutive 


Lepetit et al, Mol. Gen. Genet. 
231:276-285, 1992 


actin 2 


constitutive 


An et al, Plant J. 10(1); 107-121, 
1996 



Optionally, one or more terminator sequences may also be used in the construct introduced 
into a plant. The term "terminator* encompasses a control sequence which is a DNA sequence 
at the end of a transcriptional unit which signals 3' processing and polyadenylation of a primary 
5 transcript and termination of transcription. Additional regulatory elements may include 
transcriptional as well as translational enhancers. Those skilled in the art will be aware of 
terminator and enhancer sequences which may be suitable for use in the invention. Such 
sequences would be known or may readily be obtained by a person skilled in the art. 

10 The genetic constructs of the invention may further include an origin of replication sequence 
which is required for maintenance and/or replication in a specific cell type. One example is 
when a genetic construct is required to be maintained in a bacterial cell as an episomal genetic 
element (e.g. plasmid or cosmid molecule). Preferred origins of replication include, but are not 
limited to, the f1-ori and colE1 . 

15 

The genetic construct may optionally comprise a selectable marker gene. As used herein, the 
term "selectable marker gene" includes any gene which confers a phenotype on a cell in which 
it is expressed to facilitate the identification and/or selection of cells which are transfected or 
transformed with a genetic construct of the invention. Suitable markers may be selected from 
20 markers that confer antibiotic or herbicide resistance. Cells containing the recombinant DNA 
will thus be able to survive in the presence of antibiotic or herbicide concentrations that kill 
untransformed cells. Examples of selectable marker genes include genes conferring resistance 
to antibiotics (such as nptll encoding neomycin phosphotransferase capable of 
phosphorylating neomycin and kanamycin, or hpt encoding hygromycin phosphotransferase 
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capable of phosphorylating hygromycin), to herbicides (for example bar which provides 
resistance to Basta; aroA or gox providing resistance against glyphosate), or genes that 
provide a metabolic trait (such as manA that allows plants to use mannose as sole carbon 
source). Visual marker genes result in the formation of colour (for example beta-glucuronidase, 
5 GUS), luminescence (such as luciferase) or fluorescence (Green Fluorescent Protein, GFP, 
and derivatives thereof). Further examples of suitable selectable marker genes include the 
ampicillin resistance (Ampr), tetracycline resistance gene (Tcr), bacterial kanamycin resistance 
gene (Kanr), phosphinothricin resistance gene, and the chloramphenicol acetyltransferase 
(CAT) gene, amongst others 

10 

The present invention also encompasses plants obtainable by the methods according to the 
present invention. The present invention therefore provides plants obtainable by the method 
according to the present invention, which plants have modified growth characteristics, which 
plants have altered 2xC2H2 zinc finger protein level and/or activity and/or altered expression of 
15 a nucleic acid sequence encoding a 2xC2H2 zinc finger protein. 

Therefore, according to one aspect of the present invention, there is provided a method for the 
production of plants, having modified growth characteristics, comprising introducing, into a 
plant, a nucleic acid capable of modifying activity of a 2xC2H2 zinc finger protein and/or 
20 capable of modifying expression of a 2xC2H2 zinc-finger gene. According to a further 
embodiment of the present invention, there is provided a method for the production of 
transgenic plants having modified growth characteristics, comprising introduction and 
expression in a plant of a 2xC2H2 nucleic acid. 

25 More specifically, the present invention provides a method for the production of transgenic 
plants having modified growth characteristics, which method comprises: 
(i) introducing into a plant or plant cell a 2xC2H2 zinc finger nucleic acid; 

(iii) cultivating the plant cell under conditions promoting plant growth. 

30 The growth characteristic may be any of the characteristics defined hereinunder. 

The 2xC2H2 zinc finger nucleic acid includes all variant nucleic acids as described herein 
before and includes all nucleic acids encoding all variant proteins as described herein before. 
Cultivating the plant cell under conditions promoting plant growth, may or may not include 
35 regeneration and or growth to maturity. 
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The protein itself and/or the nucleic acid itself may be introduced directly into a plant cell or into 
the plant itself (including introduction into a tissue, organ or any other part of the plant). 
According to a preferred feature of the present invention, the nucleic acid is preferably 
introduced into a plant by transformation. . 

5 

The term "transformation" as referred to herein encompasses the transfer of an exogenous 
polynucleotide into a host cell, irrespective of the method used for transfer. Plant tissue 
capable of subsequent clonal propagation, whether by organogenesis or embryogenesis, may 
be transformed with a genetic construct of the present invention and a whole plant regenerated 

10 therefrom. The particular tissue chosen will vary depending on the clonal propagation systems 
available for, and best suited to, the particular species being transformed. Exemplary tissue 
targets include leaf disks, pollen, embryos, cotyledons, hypocotyls, megagametophytes, callus 
tissue, existing meristematic tissue (e.g., apical meristem, axillary buds, and root meristems), 
and induced meristem tissue (e.g., cotyledon meristem and hypocotyl meristem). The 

15 polynucleotide may be transiently or stably introduced into a host cell and may be maintained 
non-integrated, for example, as a plasmid. Alternatively, it may be integrated into the host 
genome. The resulting transformed plant cell can then be used to regenerate a transformed 
plant in a manner known to persons skilled in the art. 

20 Transformation of a plant species is now a fairly routine technique. Advantageously, any of 
several transformation methods may be used to introduce the nucleic acid of interest (e.g. the 
2xC2H2 nucleic acid) into a suitable ancestor cell. Transformation methods include the use of 
liposomes, electroporation, chemicals that increase free DNA uptake, injection of the DNA 
directly into the plant, particle gun bombardment, transformation using viruses or pollen and 

25 microprojectron. Methods may be selected from the calcium/polyethylene glycol method for 
protoplasts (Krens, F.A. et aL, 1882, Nature 296, 72-74; Negrutiu I. et al., June 1987, Plant 
Mol. Biol. 8, 363-373); electroporation of protoplasts (Shillito R.D. et al., 1985 Bio/Technol 3, 
1099-1102); microinjection into plant material (Crossway A. et al., 1986, Mol. Gen Genet 202, 
179-185); DNA or RNA-coated particle bombardment (Klein T.M. et al., 1987, Nature 327, 70) 

30 infection with (non-integrative) viruses and the like. A preferred transformation method is an 
Argobacterium mediated transformation method. 

Transgenic rice plants expressing a 2xC2H2 gene are preferably produced via Agrobacterium- 
mediated transformation using any of the well-known methods for rice transformation, such as 
35 the ones described in any of the following: published European patent application EP 1198985 
A1, Aldemita and Hodges (Planta, 199, 612-617, 1996); Chan et al. (Plant Mol. Biol. 22 (3) 
491-506, 1993); Hiei et al. (Plant J. 6 (2) 271-282, 1994); which disclosures are incorporated 
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by reference herein as if fully set forth. In the case of com transformation, the preferred 
method is as described in either Ishida et al. (Nat. Biotechnol. 1996 Jun; 14(6): 745-50) or 
Frame et al. (Plant Physiol. 2002 May; 129(1): 13-22), which disclosures are incorporated by 
reference herein as if fully set forth. 

5 

Generally after transformation, plant cells or cell groupings are selected for the presence of 
one or more markers which are encoded by plant-expressible genes co-transferred with the 
gene of interest, following which the transformed material is regenerated into a whole plant 

10 Following DNA transfer and regeneration, putatively transformed plants may be evaluated, for 
instance using Southern analysis, for the presence of the gene of interest, copy number and/or 
genomic organisation. Alternatively or additionally, expression levels of the newly introduced 
DNA may be undertaken using Northern and/or Western analysis, both techniques being well 
known to persons having ordinary skill in the art. 

15 

The generated transformed plants may be propagated by a variety of means, such as by clonal 
propagation or classical breeding techniques. For example, a first generation (or T1) 
transformed plant may be selfed to give homozygous second generation (or T2) transformants, 
and the T2 plants further propagated through classical breeding techniques. 

20 

The generated transformed organisms may take a variety of forms. For example, they may be 
chimeras of transformed cells and non-transformed cells; clonal transformants (e.g., all cells 
transformed to contain the expression cassette); grafts of transformed and untransformed 
tissues (e.g., in plants, a transformed rootstock grafted to an untransformed scion). 

25 

The present invention clearly extends to any plant cell or plant produced by any of the methods 
described herein, and to all plant parts and propagules thereof. The present invention extends 
further to encompass the progeny of a primary transformed or transfected cell, tissue, organ or 
whole plant that has been produced by any of the aforementioned methods, the only 

30 requirement being that progeny exhibit the same genotypic and/or phenotypic characteristic(s) 
as those produced in the parent by the methods according to the invention. The invention also 
includes host cells having modified expression and/or level and/or activity of a 2xC2H2 zinc 
finger protein . Such host cells for example comprise genetic constructs as mentioned above. 
Preferred host cells according to the invention are derived from a plant, algae, bacterium, 

35 fungus, yeast, insect or animal. The invention also extends to harvestable parts of a plant such 
as but not limited to seeds, leaves, fruits, flowers, petals, stamen, stem cultures, stem, 
rhizomes, roots, tubers, bulbs or cotton fibers. 
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The term "plant" as used herein encompasses whole plants, ancestors and progeny of the 
plants and plant parts, including seeds, shoots, stems, roots (including tubers), and plant cells, 
tissues and organs. The term "plant" also encompasses suspension cultures, embryos, 
5 meristematic regions, callus tissue, leaves, gametophytes, sporophytes, pollen, and 
microspores. Plants that are particularly useful in the methods of the invention include all 
plants which belong to the superfamily Viridiplantae, in particular monocotyledonous and 
dicotyledonous plants including, fodder or forage legumes, ornamental plants, food crop, tree, 
or shrub selected from the list comprising Acacia spp., Acerspp., Actinidia spp.,Aesculus spp., 

10 Agathis austrafis, Albizia amara, Atsophila tricolor, Andropogon spp., Arachis spp, Areca 
catechu, Astelia fragrans, Astragalus cicer, Baikiaea plurijuga, Betula spp., Brassica spp., 
Bruguiera gymnorrhiza, Burkea africana, Butea frondosa, Cadaba farinosa, Calliandra spp, 
Camellia sinensis, Canna indica, Capsicum spp., Cassia spp., Centroema pubescens, 
Chaenomeles spp., Cinnamomum cassia, Coffea arabica, Colophospermum mopane, 

15 Coronillia varia, Cotoneaster serotina, Crataegus spp., Cucumis spp., Cupressus spp., 
Cyathea dealbata, Cydonia oblonga, Cryptomeria japonica, Cymbopogon spp., Cynthea 
dealbata, Cydonia oblonga, Dalbergia monetaria, Davallia divaricata, Desmodium spp., 
Dicksonia squarosa, Diheteropogon amplectens, Dioclea spp, Dolichos spp., Dorycnium 
rectum, Echinochloa pyramidalis, Ehrartia spp., Eleusine coracana, Eragrestis spp., Erythrina 

20 spp., Eucalyptus spp., Euclea schimperi, Eulalia villosa, Fagopyrum spp., Feijoa sellowiana, 
Fragaria spp., Flemingia spp, Freycinetia banksii, Geranium thunbergii, Ginkgo biloba, Glycine 
javanica, Gliricidia spp, Gossypium hirsutum, Grevillea spp., Guibourtia coleosperma, 
Hedysarum spp., Hemarthia altissima, Heteropogon contortus, Hordeum vulgare, Hyparrhenia 
rufa, Hypericum erectum, Hyperthelia dissoluta, Indigo incarnata, Iris spp., Leptarrhena 

25 pyrolifolia, Lespediza spp., Lettuca spp., Leucaena leucocephala, Loudetia simplex, Lotonus 
bainesii, Lotus spp., Macrotyloma axillare, Malus spp., Manihot esculenta, Medicago sativa, 
Metasequoia glyptostroboides, Musa sapientum, Nicotianum spp., Onobrychis spp., 
Ornithopus spp., Oryza spp., Peltophorum africanum, Pennisetum spp., Persea gratissima, 
Petunia spp., Phaseolus spp., Phoenix canariensis, Phormium cookianum, Photinia spp., 

30 Picea glauca, Pinus spp., Pisum sativum, Podocarpus toiara, Pogonarthria fleckii, 
Pogonarthria squarrosa, Populus spp., Prosopis cineraria, Pseudotsuga menziesii, 
Pterolobium stellatum, Pyrus communis, Quercus spp., Rhaphiolepsis umbellata, 
Rhopalostylis sapida, Rhus natalensis, Ribes grossularia, Ribes spp., Robinia pseudoacacia, 
Rosa spp., Rubus spp., Salix spp., Schyzachyrium sanguineum, Sciadopitys verticillata, 

35 Sequoia sempervirens, Sequoiadendron giganteum, Sorghum bicolor, Spinacia spp., 
Sporobolus fimbriatus, Stiburus alopecuroides, Stylosanthos humilis, Tadehagi spp, Taxodium 
distichum, Themeda triandra, Trifolium spp., Triticum spp., Tsuga heterophylla, Vaccinium 
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spp., Vicia spp.Vitis vinifera, Watsonia pyramidata, Zantedeschia aethiopica, Zea mays, 
amaranth, artichoke, asparagus, broccoli, brussel sprout, cabbage, canola, carrot, cauliflower, 
celery, collard greens, flax, kale, lentil, oilseed rape, okra, onion, potato, rice, soybean, straw, 
sugarbeet, sugar cane, sunflower, tomato, squash, and tea, trees and algae amongst others. 
5 According to a preferred embodiment of the present invention, the plant is a crop plant such 
soybean, sunflower, canola, alfalfa, rapeseed, cotton, tomato, potato or tobacco. According to 
another preferred embodiment of the present invention, the plant is a monocotyledonous plant, 
such as sugar cane, further preferably a cereal, most preferably the plant is selected from the 
group consisting of rice, maize, wheat, barley, millet, rye or oats. 

10 

In a particular embodiment of the present invention, proteins of one plant species (for example 
Arabidopsis) are introduced in another plant species (for example rice). It has been shown in 
the present invention that plant growth characteristics are improved by introduction of a 
2xC2H2 zinc finger gene or protein from a dicot into a monocot. 

15 

According to a particular embodiment of the invention, there are provided methods as 
described above, wherein the plant is a monocot. More preferably the plant is rice or corn. 

Advantageously, performance of the methods according to the present invention leads to 
20 plants having modified growth characteristics. 

The term "growth characteristic" as used herein, preferably refers to anyone or more of, but is 
not limited to, yield, architecture and cycle time. 

The term "yield" means the amount of harvested material. For crop plants yield also means the 
25 amount of harvested material per acre of production. Depending on the crop the harvested part 
of the plant may be a different part or tissue of the plant, such as seed (e.g. rice, sorghum or 
com when grown for seed); total above-ground biomass (e.g. for com, when used as silage), 
root (e.g. sugarbeet), fruit (e.g. tomato), cotton fibers, or any other part of the plant which is of 
economic value. "Yield" also encompasses yield stability of the plants, meaning that year after 
30 year, one can obtain the same yield from the progeny of the plants, without too much 
interference of external factors, such as weather conditions. "Yield" also encompasses yield 
potential, which as the maximum obtainable yield. 

Yield maybe dependent on a number of yield components. The parameters for these 
components are well known by a person skilled in the art. For example breeders are well 
35 aware of the specific yield components and the corresponding parameters for the crop they are 
aiming to improve. 
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For example key yield components for corn Include number of plants per hectare or acre, 
number of ears per plant, number of rows (of seeds) per ear, number of kernels per row, and 
thousand kernel weight. For silage corn typical parameters are the above ground biomass and 
energy content. 

5 Key yield components for rice include number of plants per hectare or acre, number of 
panicles per plant, number of spikelets per panicle, seed filling rate (number of filled seeds) 
and thousand kernel weight. Preferentially methods for increasing yield of rice encompass 
increased number of flowers per panicle and an increased number of filled seeds. The 
parameter of increased total number of seeds may be linked to increased number of flowers. 
10 "Yield" further encompasses typical biomass components, such as above ground parts of a 
plant and the root system. General biomass parameters are area and dry weight. Specific 
parameters for above ground biomass further encompass above ground area and plant height. 
Specific parameters for the root system encompass root ratio, root length and penetration 
depth, root branching, root hair density, root pulling resistance and aerenchyma formation. 

15 

The plants of the present invention are characterized by increased number of filled seeds, 
increased total seed weight, increased total number of seeds and increased harvest index. 
Therefore the methods of the present invention are particularly favorable to be applied in 
cereals such as rice and corn (maize). Accordingly, a particular embodiment of the present 
20 invention relates to a method to increase yield of com, comprising modifying expression of a 
nucleic acid encoding a 2xC2H2 zinc finger protein. 



The plants of the present invention are characterized by an increase in thousand kernel weight 
and therefore the seed size or seed volume and/or the seed content and/or seed composition 
25 are altered by the methods of the present invention. The seeds provided by the methods of the 
present invention may have more nutritional value, more starch and/or more oil, possibly due 
to their bigger size. 



The plants of the present invention are characterized by more above ground area. Therefore, 
30 the methods of the present invention are particularly favorable for crops grown for their green 
tissue and/or grown for their above ground biomass. The methods of the present invention are 
particularly useful for grasses, forage crops (such as forage com (maize), clover, medicago 
etc.), trees, sugar cane etc. 



35 The improvement in yield as obtained by the methods of the invention, may be obtained as a 
result of improvement of one or more of the above mentioned yield components and/or 
parameters. 
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The term "architecture" as used herein encompasses the appearance or morphology of a 
plant, including any one or more structural features or combination of structural features 
thereof. Such structural features include the shape, size, number, position, texture, 
5 arrangement, and pattern of any cell, tissue or organ or groups of cells, tissues or organs of a 
plant, including the root, leaf, shoot, stem, petiole, trichome, flower, petal, stigma, style, 
stamen, pollen, ovule, seed, embryo, endosperm, seed coat, aleurone, fibre, cambium, wood, 
heartwood, parenchyma, aerenchyma, sieve element, phloem or vascular tissue, amongst 
others. Particular architectural characteristics that may be modified by the methods of the 

10 present invention are increased plant height, increased number or size of stems or stalks or 
tillers or panicles or pedicles, increased number or size of inflorescences, increased branching 
of for example of tassels and ears or altered flowering characteristics. A preferred architectural 
characteristic that may be modified by the methods of the present invention is leaf architecture. 
The term "leaf architecture" as used herein comprises typical leaf characteristics such as 

15 length, width, thickness, cell number, cell size and greenness. 

Typically, the plants of the present invention display increased leaf surface area and /or 
increased leaf blade width. This trait is particularly important as it allows the plant to optimize 
the shape of its leaf to maximize the area used for photosynthesis. For that purpose, preferably 
20 the leaf blade is widened, but alternatively, the leaves are longer or smaller or rounder. These 
effects may lead to more healthy plants. Alternatively, this trait attributes aesthetic properties to 
the plant such as greenness and stronger leafs. 

"cycle time" of the plant as used herein means the time wherein a plant reaches 90% of its 
25 . maximum total area. This parameter is an indication of the duration of the vegetative growth. 
Prolonged vegetative growth was only displayed in some of the plants according to the present 
invention and may be controlled by choice of the transformation event and/or by choice of the 
promoter driving the 2xC2H2 nucleic acid. For example this characteristic was not displayed 
when a seed-preferred promoter was used. 

30 

Other "growth characteristics" that may be improved by the methods of the present invention 
are growth rate, early vigour, modified Tmid, T90 or A42 or altered growth curve. 

It is clear from the data as presented in the examples that one or more of the growth 
35 characteristics as defined herein above, may be combined in one plant. Alternatively, 
depending on the chosen transformation event and/or depending on the promoter used, one 
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or more of these growth characteristics may be present or absent or more or less pronounced 
in the plant 

The methods of the present invention may also be used to confer stress tolerance to plants. In 
5 particular, a 2xC2H2 of the ST2 type may be used to confer to a plant salt stress tolerance 
and/or drought stress tolerance. According to a specirfic embodiment, a tissue preferred 
promoter, such as a seed-preferred promoter" is used in these methods. 

The present invention also relates to use of a nucleic acid sequence encoding a zinc finger 
10 protein and homologues, derivatives and active fragments thereof in modifying the growth 
characteristics of plants, preferably in increasing yield, further preferably increasing seed yield. 
The present invention also relates to use of a nucleic acid sequence encoding a 2xC2H2 zinc 
finger protein and homologues, derivatives and active fragments thereof and to the 2xC2H2 
zinc finger protein itself and to homologues, derivatives and active fragments thereof as a 
15 growth regulator. The sequences represented by SEQ ID NO 1, and portions thereof and SEQ 
ID NO 2, and homologues, derivatives and active fragments thereof are useful in modifying the 
growth characteristics of plants, as hereinbefore described. The sequences would therefore 
find use as growth regulators, such as herbicides or growth stimulators. The present invention 
also provides a composition comprising a protein represented by SEQ ID NO 2, or a 
20 homologue, derivative or active fragment thereof for the use as a growth regulator. A growth 
regulator is used herein as meaning a regulator that increased yield and is therefore also 
referred to as yield regulator. 

In particular, the present invention provides a yield regulating composition comprising a nucleic 
acid encoding a 2xC2H2 protein, and/ or comprising a 2xC2H2 protein, and/or comprising a 
25 construct as defined herein above. Such a yield regulating composition further comprises 
additives normally use in yield regulating compositions, such as a solvent or carrier. 

Conversely, the sequences according to the present invention may also be interesting targets 
for agrochemical compounds, such as herbicides or growth stimulators. Accordingly, the 
30 present invention encompasses use of a nucleic acid encoding a 2xC2H2 protein, of a 2xC2H2 
protein and/or of a construct as defined in any of claims 20 to 22 as target for an agrochemical , 
such as a herbicide or a growth stimulator. 

The methods according to the present invention may also be practised by co-expression of a 
gene encoding a 2xC2H2 zinc finger protein in a plant with at least one other gene that 
35 cooperates with the gene encoding a 2xC2H2 zinc finger protein. Such a gene may be a gene 
encoding a target protein of the 2xC2H2 zinc finger protein. Co-expression may be effected by 
cloning the genes under the control of a plant expressible promoter in a plant expressible 
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vector and introducing the expression vector(s) into a plant cell using Agrobacterium-medAated 
plant transformation. Therefore, the methods according to the present invention may result in 
plants having modified growth characteristics, particularly increased yield, as described 
hereinbefore in combination with other economically advantageous traits, such as further yield- 
5 enhancing traits, tolerance to various stresses, traits modifying various architectural features 
and/or biochemical and/or physiological features. 

Since the plants of the present invention have excellent growth characteristics and have high 
yield, they are suitable for the production of enzymes, pharmaceuticals or agrochemicals. Also, 
10 there are suitable to produce food or feed products. 

The invention clearly extends to enzymes, pharmaceuticals or agrochemicals as well as food 
or feed products isolated from these plants. 

Further a nucleic acid encoding a 2xC2H2 protein, a 2xC2H2 protein and/or the constructs of 
the present invention may be used breeding programs aiming at the development of plants 
15 with increased yield. 

Particularly, the use of allelic variants as defined above in particular conventional breeding 
programmes, such as in marker-assisted breeding is also encompassed by the present 
invention; this may be in addition to their use in the methods according to the present 

20 invention. Such breeding programmes sometimes require the introduction of allelic variations 
in the plants by mutagenic treatment of a plant One suitable mutagenic method is EMS 
mutagenesis. Identification of allelic variants then takes place by, for example, PCR. This is 
followed by a selection step for selection of superior allelic variants of the sequence in question 
and which give rise to altered growth characteristics in a plant. Selection is typically carried out 

25 by monitoring growth performance of plants containing different allelic variants of the sequence 
in question, for example, SEQ ID NO 1. Monitoring growth performance may be done in a 
greenhouse or in the field. Further optional steps include crossing plants in which the superior 
allelic variant was identified with another plant. This could be used, for example, to make a 
combination of interesting phenotypic features 

30 

According to another type of breeding programme a DNA marker is identified which may be 
genetically linked to a gene capable of modifying expression of a nucleic acid encoding a 
2xC2H2 zinc finger protein in a plant, which gene may be a gene encoding the 2xC2H2 zinc 
finger protein itself or any other gene which may directly or indirectly influence expression of 
35 the gene encoding a 2xC2H2 zinc finger protein and/or activity of the 2xC2H2 zinc finger 
protein itself. This DNA marker may then be used in breeding programs to select plants having 
altered growth characteristics. 
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The methods according to the present invention may also be practised by introducing into a 
plant at least a part of a (natural or artificial) chromosome (such as a Bacterial Artificial 
Chromosome (BAC)), which chromosome contains at least a gene encoding a 2xC2H2 zinc 
5 finger protein, optionally together with one or more related gene family members. Therefore, 
according to a farther aspect of the present invention, there is provided a method for modifying 
growth characteristics of plants by expressing in a plant at least a part of a chromosome 
comprising at least a gene encoding a 2xC2H2 zinc finger protein. 

10 The present invention will now be described with reference to the following figures in which: 

Fig. 1 is a map of an expression vector for the expression in plants of a 2xC2H2zinc finger 
protein under the control of a GOS2 promoter. CDS1536 is the internal code for the 
Arabidopsis thaliana salt tolerant zinc finger (STZ) protein cDNA. The zinc finger protein 

15 expression cassette has a GOS2 promoter and a double terminator sequence (T-zein and T- 
rbcS-deltaGA) located within the left border (LB repeat) and the right border (RB repeat) of the 
Ti plasmid. Cloned within these T-borders are also a screenable marker and a selectable 
marker, each under the control of a constitutive promoter (Prom), foil owed by a terminator 
sequence (poly a and t-NOS). Furthermore, this vector also contains an origin of replication 

20 (pBR322 (ori + bom)) for bacterial replication and a selectable marker (Sm/SpR) for bacterial 
selection. 

Fig. 2A shows digital images from a T1 positive line transformed with an STZ zinc finger 
transgene under control of a GOS2 promoter and Fig. 2B shows digital images of 
25 corresponding nullizygotes plants. 

Fig. 3 lists sequences useful in the methods of the present invention. SEQ ID NO 1 is an STZ 
encoding nucleic acid isolated from Arabidopsis thaiiana; the start and the stop codon are 
highlighted in bold. SEQ ID NO 2 is the STZ protein sequence encoded by SEQ ID NO 1. In 

30 the STZ protein the nuclear localization signal also called the KRS motif or B-box is annotated 
(bold, italics, underlined), as well as the L-box (bold, underlined), the EAR motif (bold, italics), 
and the two C2H2 zinc finger domains with QALGGH motif (bold and boxed). SEQ ID NO 10 
to SEQ ID NO 25 provides the sequences of various orthologs of the Arabidopsis thaliana STZ 
protein from other plant species. SEQ ID NO 26 to SEQ ID NO 35 provides the sequences of 

35 various paralogs (from Arabidopsis ) of the STZ protein. SEQ ID NO 36 to SEQ ID NO 50 
provides the sequences of related 2xC2H2 genes and proteins usefril in the methods of the 
present invention. 



34 



WO 2004/058980 



PCT/EP2003/051104 



Fig. 4 is a photograph of T3 plants grown in a greenhouse (A) or in a field (B). The photograph 
shows yield increase ( especially in aboveground biomass and plant height) in subsequent 
generations of STZ transformed plants. 

5 

Fig. 5 shows the binary vector for expression in Oryza sativa of the Arabidops/s thaliana STZ 
gene (CDS1536) under the control of a seed preferred WSI18 promoter (PRO0151). This 
vector contains a T-DNA derived from the Ti Plasmid, limited by a left border (LB repeat, LB Ti 
C58) and a right border (RB repeat, RB Ti C58)). 

10 The zinc finger protein expression cassette has a WSI18 (PRO0151) promoter and a double 
terminator sequence (T-zein and T-rbcS-deltaGA) located within the left border (LB repeat) 
and the right border (RB repeat) of the Ti plasmid. Cloned within these T-borders are also a 
screenable marker and a selectable marker, each under the control of a constitutive promoter 
(Prom), followed by a terminator sequence (poly a and t-NOS). Furthermore, this vector also 

15 contains an origin of replication (pBR322 (on + bom)) for bacterial replication and a selectable 
marker (Sm/SpR) for bacterial selection. 

Examples 

The present invention will now be described with reference to the following examples, which 
20 are by way of illustration alone. 

DNA Manipulation 

Unless otherwise stated, recombinant DNA techniques are performed according to standard 
protocols described in Sambrook (2001) Molecular Cloning: a laboratory manual, 3rd Edition 
25 Cold Spring Harbor Laboratory Press, CSH, New York or in Volumes 1 and 2 of Ausubel etal. 
(1988), Current Protocols in Molecular Biology, Current Protocols. Standard materials and 
methods for plant molecular work are described in Plant Molecular Biology Labfase (1993) by 
R.D.D. Croy, published by BIOS Scientific Publications Ltd (UK) and Blackwell Scientific 
Publications (UK). 

30 

Example 1 : Gene Cloning 

A gene encoding an STZ protein was amplified by PCR from an Arabidopsis thaliana seedling 
cDNA library (Invitrogen, Paisley, UK). After reverse transcription of RNA extracted from 
seedlings, the cDNAs were cloned into pCMV Sport 6.0. Average insert size of the bank was 
35 1.5 kb, and original number of clones was of 1.59x10 7 cfu. Original titer was determined to be 
9.6x10 5 cfu/ml, after first amplification of 6x1 0 11 cfu/ml. After plasmid extraction, 200 ng of 
template was used in a 50pl PCR mix. Sequences of the primers used for PCR amplification 
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were, including the attB sites for Gateway recombination (in bold) were PRM3204 (sense, start 
codon in italics) 5' GGGG ACAAGTTTGTACAAAAAAGCAGGCTTCACA/A 7GGCG 
CTCGAGGCTC 3' (SEQ ID NO 3) and PRM3205 (reverse, complementary stop codon in 
italics) 5' GGGGACCACTTTGTACAAGAAAGCTGGGTAATTTCC7T>4AAGTTGAAGTTTGA 
5 3' (SEQ ID NO 4). 

PGR was performed using Hifi Taq DNA polymerase in standard conditions. The PGR 
fragment (CDS1536) was amplified and purified using standard methods. The first step of the 
Gateway procedure, the BP reaction, was then performed, during which the PCR fragment 
10 was recombined in vivo with the pDONR plasmid to produce, according to Gateway 
terminology, an "entry clone", p3359. PDONR was purchased from Invitrogen, as part of the 
Gateway technology. 

Example 2: Vector construction for rice transformation with pGOS2::A*STZ 

15 The entry clone p3359 was subsequently used in an LR reaction with p0640, a destination 
vector used for rice transformation. This vector contains as functional elements within the T- 
DNA borders a plant selectable marker and a Gateway cassette intended for LR in vivo 
recombination with the sequence of interest already cloned in the donor vector. Upstream of 
this Gateway cassette lies the rice GOS2 promoter for constitutive expression of the zinc finger 

20 gene (De Pater ei a/., Plant J. 2 (6) 837-844, 1992). After the recombination step, the resulting 
expression vector with the expression cassette CD4398 (Figure 1) was transformed into 
Agrobacterium strain LBA4404 and subsequently into plants. Transformed rice plants were 
allowed to grow and then examined for various parameters as described in Example 3. 

25 Example 3: Evaluation of TO, T1 and T2 transgenic rice plants transformed with 
pGOS2::A*STZ (CD4398) 

Approximately 15 to 20 independent TO transformants were generated. The primary TO 
transformants were transferred from tissue culture chambers to a greenhouse for growing and 
harvest of T1 seed. Six events of which the T1 progeny segregated 3:1 for presence/absence 

30 of the transgene were retained. For each of these events, approximately 10 T1 seedlings 
containing the transgene (hetero- and homo-zygotes), and approximately 10 T1 seedlings 
lacking the transgene (nullizygotes), were selected by PCR. Based on the results of the T1 
evaluation three events were chosen, for further characterisation in the T2 generation, one 
event being very positive for a number of parameters, a second event being positive for a 

35 number of parameters, but less pronounced, and a third event being neutral. Seed batches 
from the positive plants (both hetero- and homozygotes) in T1 , were screened by monitoring 
marker expression. For each chosen event, the heterozygote seed batches were then selected 
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for T2 evaluation. An equal number of positives and negatives within each seed batch were 
transplanted for evaluation in the greenhouse (i.e., for each event 40 plants were grown of 
which there were about 20 positives for the transgene and about 20 negative). Therefore, the 
total number for the three events amounted to 120 plants for evaluation in the T2 generation. 

5 

T1 and T2 plants were transferred to the greenhouse and evaluated for vegetative growth 
parameters and seed parameters, as described hereunder. 

(I) Statistical analysis of phenotvpic characteristics 

10 A two factor ANOVA (analyses of variance) was used as statistical model for the overall 
evaluation of plant phenotypic characteristics. An F-test was carried out on all the parameters 
measured, for all the plants of all the events transformed with the gene of interest. The F-test 
was carried out to check for an effect of the gene over all the transformation events and to 
verify an overall effect of the gene or "global gene effect" . Significant data, as determined by 

15 the value of the f-test, indicates a "gene" effect, meaning that the phenotype observed is 
caused by more than the presence or position of the gene. In case of the F-test, the threshold 
for significance for a global gene effect is set at 5% probability level. 

To check for an effect of the genes within an event, i.e., for a line-specific effect, a t-test was 
20 performed within each event using data sets from the transgenic plants and the corresponding 
null plants. "Null plants" or "Null segregants"are the plants treated in the same way as the 
transgenic plant, but from which the transgene has segregated. Null plants can also be 
described as homozygous negative transformant plants. The threshold for significance for the 
t-test is set at 10% probability level. Within one population of transformation events, some 
25 events may be under or above this t-test threshold. This is based on the hypothesis that a 
gene might only have an effect in certain positions in the genome, and that the occurrence of 
this position-dependent effect is not uncommon. This kind of gene effect may also be referred 
to as a "line effect of a gene". The p- value is obtained by comparing the t-value to the t- 
distribution or alternatively, by comparing the F-value to the f-distribution. The p- value stand 
30 for the probability that the null hypothesis (null hypothesis being "there is no effect of the 
transgene") is correct. 

CI) Vegetative growth measurements 

The selected plants were grown in a greenhouse. Each plant received a unique barcode label 
35 to link unambiguously the phenotyping data to the corresponding plant. The selected plants 
were grown on soil in 10 cm diameter pots under the following environmental settings: 
photoperiod= 11.5 h, daylight intensity^ 30,000 lux or more, daytime temperature= 28°C or 
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higher, night time temperature= 22°C, relative humidity= 60-70%. Transgenic plants and the 
corresponding nullizygotes were grown side-by-side at random positions. From the stage of 
sowing until the stage of maturity (which is the stage were there is no more increase in 
biomass) the plants were passed weekly through a digital imaging cabinet (examples of 
5 pictures are shown in Figures 2A and 2B). At each time point digital images (2048x1536 pixels, 
16 million colours) were taken of each plant from at least 6 different angles. The parameters 
described below were derived in an automated way from the digital images using image 
analysis software. 

1 0 (a) Aboveground area 

Plant above ground area was determined by counting the total number of pixels from 
aboveground plant parts discriminated from the background. This value was averaged for the 
pictures taken on the same time point from the different angles and was converted to a 
physical surface value expressed in square mm by calibration. Experiments show that the 

15 aboveground plant area measured this way correlates with the biomass of plant parts above 
ground. 

Results of the maximum above ground area values of the lines selected for T2 evaluation are 
summarized in Table 1. The plants of the best performing line showed an increase in biomass 
20 of 34 % , compared to the nullizygotes. 

When an F-test was carried out on all the plants of all the T2 events it became clear that the 
transgenic plants show a significant increase in above ground area, in average an increase of 
approximately 18%. A significant increase in above ground biomass is also displayed by STZ 
transformed plants grown under field conditions (see figure 4). 

25 

Table 1 : Aboveground area of STZ transgenic T2 plants. Each row corresponds to one event, 
for which the average maximum aboveground area (expressed in mm 2 ) was determined for the 
transgenics (TR) and the nuii plants (null). The difference in absolute values between the 
transgenic population and the nullizygotes of each event are presented (dif.) as well as the 
30 percentage of difference between the two populations (% dif). P stands for the probability 
produced by the t-test for each event The last row presents the average numbers calculated 
from all the events. Here the p-value is produced by the F-test. 
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Total above ground Area Max (mm*) 


Une 


TR 


null 


dif 


% dif 


p-value 


CD4396 LI 


63947 


47606 


16341 


34 


0-0021 


CD4396 L2 


42509 


41342 


1167 


3 


0.8063 


CD4396 L3 


41116 


33687 


7429 


22 


0.1107 


Overall 


49178 


41657 


7522 


18 


0.0047 



(b) Plant height measurements 

Plant height was determined by the distance between the horizontal lines going through the 
5 upper pot edge and the uppermost pixel corresponding to a plant part above ground. This 
value was averaged for the pictures taken on the same time point from the different angles and 
was converted, by calibration, to a physical distance expressed in mm. Experiments showed 
that plant height measured this way correlate with plant height measured manually with a ruler. 

10 The increase in plant height was displayed very clearly in STZ transformed plants when 
measured at the end of the vegetative growth (see figure 4A). Also, this parameter, was 
displayed by STZ transformed plants when grown in the field (see figure 4B) at the time of 
harvest. 

15 (c) Total area cycle time measurements 

Plants were imaged weekly along the complete cell cycle and the maximum total area of the 
plants was determined as mentioned above. Total Area Cycle Time is the time when a plant 
reaches 90% of its maximum total area. This parameter is an indication of the duration of the 
vegetative growth. 

20 Only in some transgenic lines there was an effect on cycle time. These few lines showed a 
prolonged vegetative growth. 

fill) Measurement of seed-related parameters 

The mature primary panicles were harvested, bagged, barcode-labelled and then dried for 
25 three days in the oven at 37°C. The panicles were then threshed and all the seeds collected. 
The filled husks were separated from the empty ones using an air-blowing device. After 
separation, both seed lots were then counted using a commercially available counting 
machine. The empty husks were discarded. The filled husks were weighed on an analytical 
balance and the cross-sectional area of the seeds was measured using digital imaging. This 
30 procedure resulted in the set of seed-related parameters described below. 
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(a) Total number of filled seeds per plant 

The number of filled seeds was determined by counting the number of filled husks that 
remained after the separation step. 

5 Total numbers of filled seeds per plant are summarized in Table 2. The t-test shows that 
for two events, transgenic plants produce 106% and 130% more filled seeds than the 
nullizygotes. 

Table 2: Number of filled seeds of STZ transgenic T2 plants. Each row corresponds to one 
10 event, for which the average number of filled seeds was determined for the transgenics (TR) 
and the null plants (null). The difference in absolute values between the transgenic population 
and the nullizygotes of each event are presented (dif.) as well as the percentage of difference 
between the two populations (% dif). P stands for the probability produced by the t-test for 
each event The last row presents the average numbers calculated from all the events. Here 
1 5 the p-value is produced by the F-test. 



Number of filled seeds 


Une 


TR 


null 


dif 


% dif 


p-value 


CD4396 LI 


387.9 


188.7 


199.19 


106 


<0.0001 


CD4396 L2 


163.8 


156.5 


7.22 


5 


0.8382 


CD4396 L3 


236.9 


102.9 


133.98 


130 


0.0004 


Overall 


264.9 


159.7 


105.25 


66 


<0.0001 



(b) Total seed weight per plant 

The total seed weight was measured by weighing all filled husks harvested from a plant. 

20 The total seed weight values of STZ transformed plants are summarized in Table 3. STZ 
transgenic plants produce significantly more seed weight than the corresponding nullizygotes. 
The difference in seed weight of the transgenics may be as high as 138% or higher. 

Table 3: Total seed weight per plant of STZ transgenic T2 plants. Each row corresponds to 
25 one event for which the average total seed weigh (in gram) was determined for the 
transgenics (TR) and the null plants (null). The difference in absolute values between the 
transgenic population and the nullizygotes of each event are presented (dif) as well as the 
percentage of difference between the two populations (% dif). P stands for the probability 
produced by the t-test for each event. The last row presents the average numbers calculated 
30 from all the events. Here the p-value is produced by the F-test. 
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Total weight of seeds 


Line 


TR 


null 


dif 


% dif 


p-value 


CD4396 LI 


9.8 


4,5 


5.25 


116 


<0.0001 


CD4396 L2 


3.4 


3.3 


0.1 


3 


0.908 


CD4396 L3 


6.1 


2,6 


3.56 


138 


0.0001 


Overall 


6.5 


3.7 


2.75 


74 


<0.0001 



(c) Harvest index 

The harvest index in the present invention is defined as the ratio between the total seed yield 
and the above ground area (mm 2 ), multiplied by a factor 10 6 . 

5 

The harvest index values of the STZ-transgenic plants are summarized in Table 4. STZ 
transgenic plants have a significant increase in harvest index. The increase in harvest index of 
the transgenic plants may be as high as 66%, when compared to the corresponding 
nullizygotes. 

10 

Table 4: Harvest index of STZ transgenic T2 piants. Each row corresponds to one event, for 
which the average harvest index was determined for the transgenics (TR) and the nuii plants 
(null). The difference in absolute values between the transgenic population and the 
nullizygotes of each event are presented (dif.) as well as the percentage of difference between 
15 the two populations (% dif). P stands for the probability produced by the t-test for each event 
The last row presents the average numbers calculated from all the events. Here the p-value is 
produced by the F-test 



Harvest index 


Line 


TR 


null 


dif 


% dif 


p -value 


CD4396 LI 


149.1 


90 


59.11 


66 


<0.0001 


CD4396 L2 


74 


73.4 


0.55 


1 


0.9574 


CD4396 13 


121.3 


75.9 


45.32 


60 


<0.0001 


Overall 


114.8 


82.6 


32.16 


39 


<0.0001 



(d) Thousand kernel weigth (TKW) of plants 
20 Thousand Kernel Weight (TKW) is a parameter extrapolated from the number of filled seeds 
counted, and their total weight. 

The weight values of thousand kernels of STZ transgenic plants are presented in Table 5. STZ 
transgenic plants have increased thousand kernel weight. The increase of TKW of transgenic 
plants may be as high as 6% when compared to the corresponding nullizygotes. 
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Table 5: Thousand kernel weight of STZ transgenic T2 plants. Each row corresponds to one 
event, for which the average TKW was determined for the transgenics (TR) and the nuil plants 
(null). The difference in absolute values between the transgenic population and the 
5 nullizygotes of each event are presented (dif.) as well as the percentage of difference between 
the two populations (% dif). P stands for the probability produced by the t-test for each event 
The last row presents the average numbers calculated from all the events. Here the p-value is 
produced by the F-test. 



TKW 


Line 


TR 


null 


dif 


% dif 


p-vaiue 


CD4396 LI 


25.2 


23.8 


1.46 


6 


0.0128 


CD4396 L2 


20.6 


20.7 


-0.14 


-1 


0.7963 


CD4396 L3 


25.5 


24.5 


0.99 


4 


0.0812 


Overall 


23.7 


23 


0.71 


3 


0.0213 



1 0 (e) Total number of seeds 

The total number of seeds per pJant was measured by counting the number of husks harvested 
from a plant. 

The total numbers of seeds per plant are summarized in Table 6. STZ transformed plants have 
15 an increase in total number of seeds. The increase of total number of seeds may be as high as 
68%, when compared to the corresponding nullizygotes. 

Table 6: Total number of seeds of STZ transgenic T2 plants. Each row corresponds to one 
event, for which the average total number of seeds was determined for the transgenics (TR) 
20 and the null plants (null). The difference in absolute values between the transgenic population 
and the nullizygotes of each event are presented (dif.) as well as the percentage of difference 
between the two populations (% dif). P stands for the probability produced by the t-test for 
each event. The last row presents the average numbers calculated from all the events. Here 
the p-value is produced by the F-test. 



Total number of seeds 


Line 


TR 


null 


dif 


% dif 


p-value 


CD4396 LI 


483.5 


367.4 


116.03 


32 


0.0146 


CD4396 L2 


353.9 


327.5 


26.42 


8 


0.5473 


CD4396 L3 


383.6 


228.2 


155.48 


68 


0.0009 


Overall 


406 


312.5 


93.52 


30 


0.0002 
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Conclusion 

It may be concluded that vegetative growth is increased in the STZ transgenic plants when 
compared to the control non -transgenic plants, as reflected by parameters such as above 
5 ground area, where the increase is above 20 %. This effect may be attributed to the 
expression of the STZ gene in the transgenic plants. Additionally, in some transformation 
events, the length of the vegetative growth is altered in the STZ transgenic plants. For those 
transformation events in which this effect occurs, in average the vegetative growth was 
prolonged with about 4 to 6 days, under the conditions tested. 

10 Furthermore, yield was increased in STZ transgenic plants. Several seed parameters reflect 
this yield increase. The total number of seeds harvested was at least 100% higher in the 
transgenics than in the control plants, for those events showing a differential. For these 
events, there was also an increase in the total number of seeds of the transgenics, which 
increase was higher than 30 %. Seed filling in those transgenics was greatly improved, 

15 reaching differences above 100% in the number of filled seeds. 

Seed of the transgenic plants were also heavier, and probably bigger, as suggested by the 
higher values obtained for the thousand kernel weigh. The TKW parameter is a very stable 
parameter in rice cultivars, such as nipponbare, and in the growth conditions here used. This 
means that this parameter is not easily influenced and makes it an important yield parameter. 

20 Therefore a TKW increase of 6 % represents a significantly increase in yield. 

Harvest index, another important yield parameter, was increased in the transgenic plants with 
more than 50 %. 

In summary, based on the evaluation of STZ transgenic plants in the T1, T2 and further 
generations, it may be concluded that the presence of an STZ transgene, has a positive effect 
25 on the size of the plant and/or its organs, as well as a positive effect on the final yield 
harvested. 

(Hi) Root growth measurements 

Transgenic plants are grown next to their corresponding non-transgenic null segregant in 
transparent pots. In average, for each construct comprising a particular promoter~2xC2H2 
30 combination, a minimum of 5 independent transformation events are evaluated for root growth, 
root development and root architecture. Typically, per transformation event, 10 transgenics are 
compared to 10 nullizygotes. Root pictures are taken weekly during plant growth. The pictures 
are processed and analyzed to extract the values for the root parametes as detailed below. 
Statistical analysis as described above are applied to these data. 

35 
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a) Root Area 

Total root area is calculated from the summed number of pixels of each root images. A positive 
linear correlation between root area and dry weight and root biomass of the root has been 
previously established by similar experiments. Therefore, root area is a good approximation for 
5 root biomass. 

b) Root Length 

The total perimeter of the roots of a plant is calculated as the sum of the perimeter of all roots 
in the images. A linear correlation between this measurement and root length has been 
10 previously established. Thus, root length is extrapolated from the total root perimeter. 

c) Root Width 

Average root width of a plant is expressed as the ratio between the Root Area and the Root 
Length. 

15 

STZ transgenic plants of the invention show a superior performance when compared to control 
pants. Transgenic plants are altered in one or more the root parameters detailed above. In 
particular the transgenic have increased root biomass, for example due to increased root dry 
weight or area, and/or increased root length and/or increased root width. 

20 

Example 4: Leaf Blade Width Measurement 

Leaves of STZ transgenic plants appeared bigger and wider when compared to the 
corresponding control non-transgenic plants. To quantity the increase in Jeaf width, leaf blade 
width (length of transversal axe) of the flag leaf was measured with a ruler at the widest point 
25 of the leaf, which is approximately at half of the length, in plants that have reached the end of 
the vegetative growth phase. The results shown in the Table 7, indicate that the increase in 
the leaf blade width in at least the event here measured was around 15 % when compared to 
the corresponding nullizygote. 

30 Table 7: Leaf blade width of STZ transgenic T2 plants. The average leaf blade width was 
determined for the transgenics (TR) and the null plants (null) of the selected event The 
difference in absolute values between the transgenic population and the nullizygotes of the 
event is presented (dif.) as well as the percentage of difference between the two populations 
(% dif). P stands for the probability produced by the t-test . 

35 
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Leaf blade width 


Une 


TR 


null 


dif 


% dif 


p-value 


CD4396 LI 


1.56 


1.35 


0.21 


15 


0.098 



Example 5: Vector construction for rice transformation with pWSI18::AtSTZ 

Vector construction for transformation with the pWSI18 (PRO0151) - AtSTZ (CDS1536) 
5 cassette was carried out essentially as in example 2. The entry clone p3359, described earlier, 
was subsequently used in an LR reaction with p05653, a destination vector used for rice 
transformation. This destination vector contains as functional elements within the T-DNA 
borders a plant selectable marker and a Gateway cassette intended for LR in vivo 
recombination with the sequence of interest already cloned in the donor vector. A WSI18 
10 promoter for seed preferred expression (PRO0151) is located upstream of this Gateway 
cassette. After the recombination step, the resulting expression vector with the expression 
cassette CD4398 (Figure 5) was transformed into Agrobacterium strain LBA4404 and 
subsequently this vector was transformed to Oryza sativa plants. Transformed rice plants were 
allowed to grow and then examined for various parameters as described in example 3. 

15 

Example 6: Evaluation of TO and T1 transgenic rice plants transformed with the seed 
preferred expression cassette pWS1 18:: AtSTZ (CD4398) 

Preparations of calli and of the Agrobacterium tumefaciens strain containing the expression 
vector with the CD4398 expression cassette, were carried out as described in example 3, as 
20 were the calli transformation and plant regeneration. 

Approximately 15 to 20 independent TO rice transformants were generated. The primary 
transformants were transferred from tissue culture chambers to a greenhouse for growing and 
harvest of T1 seed. Events, of which the T1 progeny segregated 3:1 for presence/absence of 
25 the transgene, were retained. For each of these events, approximately 10 T1 seedlings 
containing the transgene (hetero- and homo-zygotes), and approximately 10 T1 seedlings 
lacking the transgene (nullizygotes), were selected by monitoring marker expression. 
Transgenic plantlets were grown next to control nullyzygotes, seeds were harvested and 
thousand kernel weight determined as previously described. 

30 

Transformed plants comprising the expression cassette CD8490 (seed preferred 
pWSI18::STZ), had a normal and healthy appearance and were harvested at the same time as 
the control plants. The seeds harvested from the transgenic plants had an increase in 
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thousand kernel weight when compared to the control plants. As shown in Table 8 increase in 
thousand kernel weight was above 10%. 

Table 8: Thousand kernel weight of STZ transgenic T1 plants. The average I thousand kernel 
5 weight was determined for the transgenics (TR) and the null plants (null) of the selected event 
The difference in absolute values between the transgenic population and the nullizygotes of 
the event is presented (dif.) as well as the percentage of difference between the two 
populations (% dif). P stands for the probability produced by the t-test . 



Thousand kernel weight 


Line 


TR 


null 


dif 


% dif 


p-value 


CD8490 LI 


29.6 


26.8 


2.82 


11 


0.001 



10 

Example 7: Cloning, transformation and evaluation of other 2xC2H2 encoding genes. 

In Table 9 an overview is given of constructs with STZ or other 2xC2H2 zinc finger proteins, 
under control of various promoters, which constructs are made for use in the methods of the 
present invention. The coding regions of the 2xC2H2 genes to be cloned (GOI, Gene of 

15 Interest) are amplified by PCR from cDNA , following the protocol as in Example 1. Specific 
primers for each 2xC2H2 gene were designed at the start and stop codons of the gene 
sequence as present in the public database under the accession number as indicated in Table 
9. These cloned sequences are also herein incorporated under the SEQ ID NO number as 
mentioned in the table. Moreover, the isolated PCR fragments were also given a unique CDS 

20 number. 

The PCR fragment with a 2xC2H2 gene is then cloned under the control of a particular 
promoter. Different combinations for different genes are made (see Table 9). Chimeric 
constructs are made and CD numbers represent bacterial strains carrying the chimeric 
construct. Corresponding transgenic plants are obtained by transforming the plants with the 
25 chimeric constructs, following the protocols as mentioned herein before. Evaluation of the 
transgenic events reveals an increase in yield, and increase in leaf surface area and/or an 
increase in duration of vegetative growth in the transgenic plants when compared to the control 
non-transgenic plants. 

30 
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CD-070-PCT 



Table 9: examples of 2xC2H2 chimeric constructs useful for the methods of the present invention. *see T* 



CDS 


Accession number 
(cDNA on which primers 
were designed to amplify 
the CDS region) 


Prot ACC number 


SEQ ID NO 


PRO0129* 


PRO0170* 


PROQ0$1_2* 


PRO0123* 


PRO0207* 


PRO011I 


CDS1536 
STZ 

Arabidopsis 


X95573 


CAA64820 


1 +2 


CD4398 


CD11371 


CD11382 


CD10960 


CD10959 


CD1031; 


CDS2200 

Paralog 

Arabidopsis 


AF022658 NIVM20516 


AAB80922.1At5g04340 


28 + 29 


CD11576 






CD11413 




CD1154< 


CDS2205 

Paralog 

Arabidopsis 


NMJ 23683 


At5g43170 


32 + 33 


CD11325 






CD11414 




CD1138' 


CDS2775 
Ortholog 
Oryza 
sativa 


AF332876 


AAK01713.1 


36 + 37 


CD09948 










CD103H 


CDS1677 
Homolog 
Arabidopsis 


AL1 32966 REGION: 
116202..1 16729 


CAB67667 


38 + 39 


CD06462 






CD 




CD 


CDS3337 
Homolog 
Sugarcane 


CA279020 




40 


CD 






CD 




CD 


CDS2416 
Homolog 
Arabidopsis 


AF254447 


At3g57670 


41 +42 


CD 






CD 




CD 


CDS2377 
Homolog 
Arabidopsis 


AJ311810 


CAC86167 


43 + 44 


CD 






CD 




CD 


CDS 

Homolog 

Arabidopsis 


AL355775 REGION: 
compfement(7957..8451 ) 


CAB90935 


45 + 46 


CD 






CD 




CD 


CDS 

Homolog 

Arabidopsis 


AL391143 REGION: 
comp!ement(31 730..32938) 


CAC01747 


47 + 48 


CD 






CD 




CD 


CDS3641 
Homolog 
Arabidopsis 


X98678 


CAA67236 


49 + 50 


CD 






CD 




CD 
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Table 10: examples promoters used in combination with 2xC2H2 for the methods of the 
present invention. 



Promoter 


Preferred expression type 


Origin species 


Gene 


PRO0151 


Seeds (mainly embryo and 
aleurone). Strong expression. 


Oryza sativa 


WSI18 


PRO0110 


Root 


Oryza sativa 


RCc3 


PRO0207 


Green tissue. Moderate expresssion 
levels 


Saccharum officinarum 


Prp 


PRO0123 


Green tissue. Strong expression 
levels. 


Oryza sativa 


Protochlorophyllide 
reductase 


PRO0090 


Seed specific (mainly endosperm) 


Oryza sativa 


Prolamin RP6 


PRO0170 


Constitutive. Strong Expression. 


Oryza sativa 


High Mobility 
Group protein 


PRO0218 


Seeds (mainly embryo and 
aleurone) 


Oryza sativa 


oleosine 18kda 


PRO0061_2 


Young expanding tissues 


Oryza sativa 


beta-expansine 
EXPB9 


PRO0129 


Constitutive. High expression levels. 


Oryza sativa 


GOS2 



5 

Example 8: use of the invention in corn. 

The methods of the invention described herein are also used in maize. To this aim, an STZ 
encoding gene, for example a maize or other STZ ortholog, is cloned under control of a 
promoter operable in maize, in a plant transformation vector suitable for Agrobacterium- 
10 mediated corn transformation. Methods to use for corn transformation have been described in 
literature (Ishida et al., Nat Biotechnol. 1996 Jun;14(6):745-50; Frame et al., Plant Physiol. 
2002 May;129(1):13-22). 

Transgenic plants made by these methods are grown in the greenhouse for T1 seed 
15 production. Inheritability and copy number of the transgene are checked by quantitative real- 
time PCR and Southern blot analysis and expression levels of the transgene are determined 
by reverse PCR and Northern analysis. Transgenic lines with single copy insertions of the 
transgene and with varying levels of transgene expression are selected for T2 seed 
production. 

20 

Progeny seeds are germinated and grown in the greenhouse in conditions well adapted for 
maize (16:8 photoperiod, 26-28°C daytime temperature and 22-24°C nighttime temperature) 
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as well under water-deficient, nitrogen-deficient, and excess NaCI conditions. Null segregants 
from the same parental line, as well as wild type plants of the same cultivar are used as 
controls. The progeny plants resulting from the setting or the crosses are evaluated on different 
biomass and developmental parameters, including, plant height, stalk/stem thickness, stem 
5 size, number of leaves, total above ground area, leaf greenness, time to maturity, time to 
silking, flowering time, time to flower, ear number, ear length, row number, kernel number, 
kernel size , kernel oil content, grain maturity, harvesting time. The seeds of these lines are 
also checked on various parameters, such as grain size, total grain yield per plant, and grain 
quality (starch content, protein content and oil content). 

10 

Lines that are most significantly improved compared to corresponding control lines are 
selected for further field-testing and marker-assisted breeding, with the objective of transferring 
the field-validated transgenic traits into commercial germplasm. The testing of maize for 
growth and yield-related parameters in the field is conducted using well-established protocols. 
15 The corn plants are particularly evaluated on yield parameters, such as for example, amount of 
plants per acre, amount of ears per plant, amount of rows per ear, amount of seeds per row 
and TKW. Subsequent improvements for introgressing specific loci (such as transgene 
containing loci) from one germplasm into another is also conducted using well-established 
protocols. 

20 
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Claims 

1. Method for increasing plant yield relative to corresponding wild type plants, comprising 
modifying expression in a plant of a nucleic acid sequence encoding a 2xC2H2 zinc finger 

5 protein and/or modifying in a plant level and/or activity of a 2xC2H2 zinc finger protein. 

2. Method for increasing leaf surface area relative to corresponding wild type plants, 
comprising modifying expression in a plant of a nucleic acid sequence encoding a 2xC2H2 
zinc finger protein and/or modifying in a plant level and/or activity of a 2xC2H2 zinc finger 

10 protein. 

3. Method for prolonging vegetative growth phase of a plant relative to corresponding wild 
type plants, comprising modifying expression in a plant of a nucleic acid sequence 
encoding a 2xC2H2 zinc finger protein and/or modifying in a plant level and/or activity of a 

15 2xC2H2 zinc finger protein. 

4. Method according to any of claims 1 to 3, wherein said modifying expression, level and/or 
activity is effected by recombinant means and/or chemical means. 

20 5. Method according to any of claims 1 to 4, wherein said 2xC2H2 zinc finger protein 
comprises a QALGGH motif. 

6. Method according to any of claims 1 to 4, wherein said 2xC2H2 zinc finger protein 
comprises a NNM(W)QMH motif. 

25 

7. Method according to any of claims 1 to 6, wherein said 2xC2H2 zinc finger protein 
comprises an EAR motif. 

8. Method according to any of claims 1 to 7, wherein said 2xC2H2 zinc finger protein further 
30 comprises a B-box. 

9. Method according to any of claims 1 to 8, wherein said 2xC2H2 zinc finger protein further 
comprises an L-box. 

35 10. Method according to any of claims 1 to 9, wherein said 2xC2H2 zinc finger protein is 
derived from a dicotyledonous plant, preferably from the family Brassicaceae, further 
preferably from Arabidopsis thaliana, more preferably the nucleic acid is as represented by 
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SEQ ID NO 2 or a homologue, derivative or active fragment thereof and/or wherein said 
nucleic acid is as represented by SED ID NO 1 or a portion thereof or sequences capable 
of hybridising therewith . 

5 11. Method according to claim 10, wherein said homologue, derivative or active fragment has, 
in increasing order of preference, at least 30%, 31%, 32%, 33%, 34%, 35%, 36%, 37%, 
38%, 39%, 40%, 41%, 42%, 43%, 44%, 45%, 46%, 47%, 48%, 49%, 50%, 52%, 54%, 
56%, 58%, 60%, 62%, 64%, 66%, 68%, 70%, 72%, 74%, 76%, 78%, 80%, 82%, 84%, 
86%, 88%, 90%, 92%, 94%, 96%, 98% sequence identity with the sequence of SEQ ID NO 
10 2. 

12. Method according to any of claims 1 to 1 1 , wherein said plant is a monocot 

13. Method according to any of claims 1 to 12, wherein said modifying expression is effected 
15 by introducing into a plant a nucleic acid capable of modifying expression of a gene 

encoding a 2xC2H2 zinc finger protein and/or capable of modifying level and/or activity of a 
2xC2H2 zinc finger protein. 

14. Method according to claim 13, wherein said nucleic acid capable of modifying expression is 
20 a nucleic acid encoding a 2xC2H2 protein, such as a 2xC2H2 protein as defined in any of 

claims 5 to 11. 

15. Method according to claims 13 or 14, wherein said nucleic acid introduced into a plant is an 
alternative splice variant of a nucleic acid as defined in claim 14. 

25 

16. Method according to claims 13 or 15, wherein said nucleic acid introduced into a plant is an 
allelic variant of a nucleic acid as defined in claim 14. 

17. Method according to claims 13 or 16, wherein said nucleic acid introduced into a plant is 
30 comprised on at least part of a chromosome. 

18. Method according to any of claims 1 to 17, wherein said modifying expression comprises 
increased expression. 

35 19. Method according to any of claims 1 to 18, wherein expression of said nucleic acid is 
driven by a plant promoter, preferably a constitutive promoter, such as a GOS2 promoter. 
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20. Method according to any of claims 1 to 18, wherein expression of said nucleic acid is 
driven by a plant promoter, preferably a tissue preferred promoter, such as seed-preferred 
promoter. 

5 21 Method according to any of claims 1 to 20, wherein said increased yield comprises 
increased above ground biomass. 

22. Method according to any of claim 1 to 20, wherein said increased yield comprises 
increased seed yield. 

10 

23. Method according to any of claim 1 to 20, wherein said increased yield comprises 
increased root yield. 

24. Construct comprising: 

15 (i) A nucleic acid capable of modifying expression of a nucleic acid encoding a 

2xC2H2 zinc finger protein and/or capable of modifying level and/or activity of a 
2xC2H2 zinc finger protein; 
(ii) One or more plant control sequence capable of driving expression of the 
nucleic acid sequence of (i); and optionally 

20 (iii) A transcription termination sequence. 

25. Construct according to claim 24, wherein said nucleic acid of (i) is a nucleic acid as defined 
in any of claims 14 to 17. 

25 26. Construct according to claim 24 or 25, wherein said control sequences of (ii) is at least a 
constitutive promoter, such as a GOS2 promoter. 

27. Construct according to claim 24 or 25, wherein said control sequences of (ii) is at least a 
tissue preferred promoter, such as seed-preferred promoter. 

30 

28. Host cell comprising a construct according to any of claims 24 to 27. 

29. Method for the production of a transgenic plant having increased yield, increased leaf 
surface area and/or prolonged vegetative growth, which method comprises 

35 (i) introducing into a plant or plant cell a 2xC2H2 zinc finger nucleic acid; 

(ii) Cultivating the plant or plant cell under conditions promoting plant 
growth. 
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30. Plant obtainable by a method according to any of claims 1 to 23 and 29, which plant has 
increased yield, modified leaf surface area and/or prolonged vegetative growth, relative to 
corresponding wild type plants. 

5 

31. Transgenic plant having increased yield, increased leaf surface area and/or prolonged 
vegetative growth, which transgenic plant has modified expression of a nucleic acid 
encoding a 2xC2H2 zinc finger protein and/or modified level and/or activity of a 2xC2H2 
zinc finger protein, relative to corresponding wild type plants. 

10 

32. Plant part, preferably a harvestable part, a propagule or progeny of a plant as defined in 
claim 30 or 31, which progeny has modified expression of a nucleic acid encoding 2xC2H2 
zinc finger protein and/or modified level and/or activity of a 2xC2H2 zinc finger protein, 
relative to corresponding wild type plants. 

15 

33. Plant or plant part according to any of claims 30 to 32, which plant is a monocotyledonous 
plant, preferably a cereal. 

34. Plant or plant part according to any of claims 30 to 33 selected from rice, maize, wheat, 
20 barley, millet, oats, rye, sorghum, soybean, sunflower, canola, sugarcane, alfalfa, 

leguminosae (bean, pea), flax , lupinus, rapeseed, tobacco, tomato, potato, squash, 
papaya, poplar and cotton. 

35. Use of a nucleic acid encoding a 2xC2H2 protein, of a 2xC2H2 protein and/or of a 
25 construct as defined in any of claims 24 to 27 to increase plant yield. 

36. A yield regulating composition comprising a nucleic acid encoding a 2xC2H2 protein, and/ 
or comprising a 2xC2H2 protein, and/or comprising a construct as defined in any one of 
claims 24 to 27. 

30 

37. Use of a nucleic acid encoding a 2xC2H2 protein, of a 2xC2H2 protein and/or of a 
construct as defined in any of claims 24 to 27 to increase leaf surface area. 

38. Use of a nucleic acid encoding a 2xC2H2 protein, of a 2xC2H2 protein and/or of a 
35 construct as defined in any of claims 24 to 27 to prolong vegetative growth. 
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39. Use of a nucleic acid encoding a 2xC2H2 protein, of a 2xC2H2 protein and/or of a 
construct as defined in any of claims 24 to 27 as target for an agrochemical. 

40. Use of a nucleic acid encoding a 2xC2H2 protein, of a 2xC2H2 protein and/or of a 
5 construct as defined in any of claims 24 to 27 in a breeding program. 

41 . Use of a plant as defined in any of claims 30 to 34 to produce enzymes, pharmaceuticals 
or agrochemicals. 

10 42. Use of a plant as defined in any one of claims 30 to 34 to produce food or feed products. 



15 
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constitutive promoter LB repeat nopallne 



FIGURE 1 
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SEQ ID NO 1: Arabidopsis thaliana STZ cDNA (CDS1536) 

AATGGCGCTCGAGGCTCTTACATCACCAAGATTAGCTTCTCCGATTCCTCCTTTGTTCGAAG 
ATTCTTCAGTCTTCCATGGAGTCGAGCACTGGACAAAGGGTAAGCGATCTAAGAGATCAAGA 
TCCGATTTCCACCACCAAAACCTCACTGAGGAAGAGTATCTAGCTTTTTGCCTCATGCTTCT 
CGCTCGCGACAACCGTCAGCCTCCTCCTCCTCCGGCGGTGGAGAAGTTGAGCTACAAGTGTA 
GCGTCTGCGACAAGACGTTCTCTTCTTACCAAGCTCTCGGTGGTCACAAGGCAAGCCACCGT 
AAGAACTTATCACAGACTCTCTCCGGCGGAGGAGATGATCATTCAACCTCGTCGGCGACAAC 
CACATCCGCCGTGACTACTGGAAGTGGGAAATCACACGTTTGCACCATCTGTAACAAGTCTT 
TTCCTTCCGGTCAAGCTCTCGGCGGACACAAGCGGTGCCACTACGAAGGAAACAACAACATC 
AACACTAGTAGCGTGTCCAACTCCGAAGGTGCGGGGTCCACTAGCCACGTTAGCAGTAGCCA 
CCGTGGGTTTGACCTCAACATCCCTCCGATCCCTGAATTCTCGATGGTCAACGGAGACGACG 
AAGTCATGAGCCCTATGCCGGCGAAGAAGCCTCGGTTTGACTTTCCGGTCAAACTTCAACTT 
TAAGGAAATT 

SEQ ID NO 2: Arabidopsis thaliana STZ protein with annotation 
of the domains 

B-BOX L-BOX 

male alts priias p ipplf edssvfhgvehwtkgkrskr5jr s dfhhqnlt eeeylafclmll 
ardnrqpppppaveklsyklcsvcdktfssyqalgghkash|rknlsqtlsgggddhstssatt 
tsavttgsgkshv |cticWsfpsgoalgghkrch| yegnnnintssvsnsegagstshvsssh 



RGFDiNIPPIPEFSMVNGDDEVMSPMPAKKPRFDFPVKLQL 
EAR motif 

SEQ ID NO 3: PRM3204 (sense, start codon in italics) 

5' GGGGACAAGTTTGTACAAAAAAGCAGGCTTCACAATGGCGCTCGAGGCTC 3' 

SEQ ID NO 4: PRM3205 (reverse, complementary stop codon in 
italics) 

5 ' GGGGACCACTTTGTACAAGAAAGCTGGGTAATTTCC TTJ1AAGTTGAAGTTTGA 3 ' 



BOXES 



SEQ ID NO 5: QALGGH motif 
QALGGH 

SEQ ID NO 6: NNM motif 
NN (M/W) QMH 

SEQ ID NO 7: EAR motif 
hDLNh (X)P 



SEQ ID NO 8: B-Box 
KR ( S ) KRXR 

SEQ ID NO 9: L-Box 
EXEXXAXCLXXL 



FIGURE 3 
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ORTHOLOGS OF STZ ±n OTHER PLANT SPECIES 

SEQ ID NO 10: Dg_AF119050_l Datisca glomerata zinc-finger 
protein 1 (zfpl) mRNA, complete cds 

GGCACGAGGACAAATTCTCTCTCTATCCTCTGAATATCTTTGGTTTGTGAACTGAGAAGCTA 
TTAGATGGCTCTAGAAGCGCTCAACTCTCCGACCACAGCTACGCCGGTGTTTCACTACGACG 
ACCCCAGCTTGAATTACCTTGAGCCATGGACCAAGCGTAAGCGTTCCAAGCGTACGCGCTTA 
GATAGCCCCATACCGAGGAAGAGTACCTTGCTTTCTGCCTCATCATGCTCGCTCGTGGCCGC 
GTTGCCTCTGCAAATCGACGGGATTCTCAGTCTTCCATTCAGATTCAGCCTGAAGCAACGAC 
TTCGGCTACCAAAGTCAGTTATAAGTGCTCTGTGTGCGATAAGGCCTTTTCGTCTTATCAGG 
CTTTGGGTGGGCACAAGGCCAGCCACAGAAAGCTCGCTGGCGGCGAAGATCAATCGACTTCC 
TTTGCCACCACGAATTCAGCCACCGTCACTACCACCACAGCCTCCGGAGGTGGTGGCAGGTC 
TCATGAGTGTTCTATTTGCCACAAATCGTTCCCGACTGGCCAGGCCTTGGGTGGTCACAAGC 
GCTGCCACTACGAAGGCAGTATCGGCGGCAATAGTATTCACCACCACAACAATACCACCAAC 
AGCGGAAGCAACGGTGGCATGAGCATGACCTCCGAAGTAGGTTCCACACACACAGTCAGCCA 
CAGTCACCGTGACTTCGATCTCAACATCCCGGCCTTGCCGGAGTTTCGGTCGAATTTCTTCA 
TATCCGGGGATGACGAGGTCGAGAGTCCTCATCCGGCCAAGAAACCCCGTATATTGATGAAA 
JEflAAAC AT T T C T C AAGATC AC TG AACC AG GCT T TAG T T T C T T TAT AGG AGG AG AT T T AAAAA 
AGTAGTATCTCTCTTTCTTTATCCGTAGGATAATTAATATATTTCGTGTACATAAATTTGTA 
GTTCTTTAACACACTCTGTTTCATTTTCTTGCTTTGCTCAACTTTGTATTGGTTATTTCATT 
ATGAAAATTCAATT 

SEQ ID NO 11: Dg_AF119050_l Datisca glomerata,, STZ ortholog, 
protein 

l^LEALNSPTTATPVFHYDDPSLNYLEPWTKRKRSKRTRLDSPHTEEEYLAFCLIMLARGRV 
ASANRRDSQSSIQIQPEATTSATKVSYKCSVCDKAFSSYQALGGHKASHRKLAGGEDQSTSF 
ATTNSATVTTTTASGGGGRSHECSICHKSFPTGQALGGHKRCHYEGSIGGNSIHHHNNTTNS 
GSNGGMSMTSEVGSTHTVSHSHRDFDLNIPALPEFRSNFFISGDDEVESPHPAKKPRILMK 

SEQ ID NO 12 : Gm_T09602_U687 63 . 1_GMU68763 Glycine max (soybean) 

probable zinc finger protein SCOF-1 mRNA, complete cds 

AAAATTCTCACTCTCTCTCTCATCTCGAGATCATAGTATCATATTCAATATCATTTCATACC 

AAACACATGGCTTTGGAAGCTCTCAACTCACCAACAACAACCGCTCCATCTTTTCCCTTTGA 

CGACCCAACTATTCCATGGGCGAAACGAAAACGTTCAAAGCGTTCTCGCGACCATCCTTCTG 

AAGAAGAGTACCTCGCCCTCTGCCTCATCATGCTCGCTCGCGGCGGCACCACCACCGTCAAC 

AACCGCCACGTCAGCCCTCCGCCGCTACAGCCACAGCCACAGCCGACACCAGATCCTTCCAC 

CAAGCTCAGTTACAAATGCTCCGTTTGCGACAAGAGCTTCCCCTCTTACCAAGCGCTCGGTG 

GACACAAGGCCAGTCACCGGAAACTCGCCGGCGCCGCCGAAGACCAACCCCCCAGCACCACC 

ACTTCCTCCGCCGCCGCCACCAGCTCCGCCTCCGGAGGTAAGGCCCATGAGTGCTCCATTTG 

CCACAAATCCTTCCCCACCGGACAGGCCCTTGGCGGACACAAACGTTGTCACTACGAAGGTA 

ACGGTAACGGAAATAACAACAACAGTAACAGCGTTGTCACCGTCGCCTCGGAAGGCGTGGGC 

TCCACCCACACTGTCAGTCACGGCCACCACCGCGACTTCGATCTCAACATCCCGGCCTTTCC 

GGATTTTTCGACCAAGGTCGGAGAAGACGAGGTTGAGAGCCCTCACCCTGTCATGAAGAAGC 

CTCGCCTCTTCGTCATTCCCAAGATCGAAATCCCCCAATTTCAATGAACTCGTTGAATTTTT 

AGT T TAT T T T TCG AC TAT AT AT T T T GG AG AAT TT TG AG AG T T AC T AT AAT T T GAT T T TGT AC 

ATAGTACTTGGAAGTTTTGTTGGACCGTACCGGACCCAGTTCTCTGGTTGAGGTTGTACTTT 

CACAACAGTGGCAGATTTGCAATTCAATTCAATTTATTTGTTTATTTTAAAAAAAAAAAAAA 

AAAA 

FIGURE 3 (continued) 
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SEQ ID NO 13: Gm_T09 602 Glycine max (soybean) probable zinc 
finger protein SCOF-1, STZ otholog, protein 
MALEALNSPTTTAPSFPFDDPTIPWAKRKRSKRSRra 

HVSPPPLQPQPQPTPDPSTKLSYKCSVCDKSFPSYQALGGHKASHRKLAGAAEDQPPSTTTS 
SAAATSSASGGKAHECSICHKSFPTGQALGGHKRCHYEGNGNGNNNNSNSVVTVASEGVGST 
HT VS HGHHRDFDLN I PAFPDF S TKVGE DE VE S PHPVMKKPRLFVI PKI E I PQFQ 

SEQ ID NO 14: Ms_CAB77055__ Y1878 8 . 1_MSY18788 Medicago sativa 
putative TFIIIA (or kruppel) -like zinc finger protein iaRNA 
AATTCGGCACGAGAAATAACCACTTCTCTCTCAAAACCTCCTTTTGCCTTTTGCTTCTACTT 
TCACTTGCGTAACGCTAACTAACTCTTCTCGAGTGTTCTTCTTTTCATCATATGGCTATGGA 

AGC AC T T AAC T C ACC C ACC AC TGCT AC TC C T T TC AC AC CC T T TG AG GAAC C AAAT C TG AG T T 
ATCTTGAAACACCGTGGACGAAAGGTAAACGATCAAAGCGTTCTCGCATGGATCAATCTTCA 
TGCACTGAAGAAGAGTATCTCGCTCTTTGTCTCATCATGCTTGCTCGCAGCGGTAACAACAA 
CGACAAAAAGTCTGATTCGGTGGCGACGCCGCTAACCACCGTTAAACTCAGTCACAAATGCT 
CAGTCTGCAACAAAGCTTTCTCATCTTATCAAGCCCTAGGTGGACACAAAGCCAGTCACCGG 
AAAGCTGTTATGTCCGCAACCACCGCTGAAGATCAGATCACCACCACTTCATCCGCCGTGAC 
T ACC AGC T CT G CT TC C AAC GG T AAG AACAAG AC T CATG AG TGTTCC AT CTGTC AC AAAT CCT 
TCCCTACTGGACAGGCTTTGGGAGGACACAAGCGTTGTCACTACGAAGGCAGCGTTGGTGCC 
GGTGCCGGTGCTGGAAGTAACGCTGTAACTGCCTCTGAAGGAGTTGGATTGTCACACAGCCA 
CCACCGTGATTTTGATCTTAACCTCCCGGCTTTTCCGGACTTTTCAAAGAAGTTTTTCGTGG 
ATGACGAGGTTTTTAGTCCTTTACCTGCTGCAAAGAAGCCCTGTCTTTTCAAGCTGGAAATT 
CCTTCTCATTACTGATCAATAATAGATCCAATTTTATTGTTATTATTATTAATAATTATTAT 
CGCTTAGGGCATAGTTATTTTCTTTTTTCTTTCAATTATTTCGGATCAATTTGTTCTGTACA 
T AC AAAT T GG GAT TGGT T T T AGAAT T T AGG AC GG TT GT AG AC AATGGAAAT TC AAT T C AAT T 
ATTTAATTTTGTGT 

SEQ ID NO 15: Ms_CAB77055_Medicago sativa putative TFIIIA (or 
kruppel) -like zinc finger protein, STZ otholog, protein 
MAMEALNSPTTATPFTPFEEPNLSYLETPWTKGKRSK 
GNNNDKKSDSVATPLTTVICLSHK^ 

SAVTTSSASNGKNKTHECSICHKSFPTGQALGGHKRCHYEGSVGAGAGAGSNAVTASEGVGL 
SHSHHRDFDLNLPAFPDFSKKFFVDDEVFSPLPAAKKPCLFKLEIPSHY 

SEQ ID NO 16: Nt_AAC06243_ AF053077 Nicotiana tabacum osmotic 

stress-induced zinc-finger protein (zfp) mRNA, complete cds 

T T TT CC CT CG AAT T T GATAAC TAAAGAG AAT AT TAT G AC T CT TG AAGC T T T GAAG TC AC CT A 

CG GC GGC AAC GCC GACT CT AC C ACC AC GC TA5PGAAGAT GATG AT GAAAT T CAT AAT T T GGAT 

TCTTGGGCTAAAGGAAAACGATCAAAACGGCCCCGTATTGATGCCCCACCGACTGAAGAAGA 

GTATT TAG CCCTCTGTCTCATCATGCTCGCTCGCAGCGGAACCG GAAC CAGAACCGGTTTAA 

CTGATGCTACTACTTCCCAACAACCTGCCGATAAAAAAACCGCCGAGTTGCCGCCGGTTCAT 

AAGAAAGAGG T GGCAAC AG AG C AAGC AGAG C AAT CT T AC AAG TG TAGCG TGTG TG AC AAGGC 

TTTTTCTTCTTATCAAGCACTCGGTGGGCATAAAGCAAGTCACCGTAAAACTACTACTACTG 

CTACCGCCGCCTCTGATGATAACAATCCTTCAACTTCAACTTCCACTGGCGCCGTTAATATC 

TCTGCTCTTAATCCAACTGGTCGTTCACACGTCTGTTCTATTTGCCACAAGGCTTTTCCTAC 

TGGCCAAGCTTTGGGTGGGCACAAGCGCCGCCACTATGAAGGCAAACTCGGTGGTAACAGCC 

GCGACTTAGGCGGCGGCGGCGGCGGCGGTCATAGTGGAAGCGTCTTGACTACTTCAGACGGC 
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GGCGCGTCGACTCACACGCTACGTGACTTTGACCTGAACATGCCTGCTTCGCCGGAATTGCA 
ACTGGGTCTGAGTATTGATTGTGGACGGAAAAGTCAACTGTTGCCGATGGTCCAAGAGGTGG 
AAAGTCCTATGCCTGCAAAGAAACCGCGTTTATTGTTTTCGTTGGGTTGAAACTTCTTTAGG 
GGAATTGAATTGATTGTGTTTTAGCCAAATTAGTAAATTGGTTCATGTGATTTTATTTTTAG 
GAAAAGGAATTATTGATTGTTTTACCCGTTTATTCTTAGGGTGGTATTATGTACAGGGAGTG 
AATC AT T CAT T GG T T T T AC AC T T T C T T AAT T ATATAT TCTTTTTTTT T AC AC AT AAAAAAAA 
AAAAAAA 

SEQ ID NO 17: Nt_AAC06243_Nicotiana tabacum osmotic stress- 
induced zinc-finger protein , STZ otholog, protein 

MTLEALKSPTAATPTLPPRYEDDDEIHNLDSWAKGKRSKRPRIDAPPTEEEYLALCLIMLAR 
SGTGTRTGLTDATTSQQPADKKTAELPPVHKKEVATEQAEQSYKCSVCDKAFSSYQALGGHK 
AS HRKT TTTAT AAS DDNN PS T S T S T GAVN I S ALNP TGRS HVC S I CHKAFPTGQALGGHKRRH 
YEGKLGGNSRDLGGGGGGGHSGSVLTTSDGGASTHTLRDFDLNMPASPELQLGLSIDCGRKS 

QLLPMVQEVE S PMPAKKPRLLFSLG 

SEQ ID NO 18: Os_AF332876 Oryza sativa zinc finger 
transcription factor ZF1 mRNA, complete cds 

AATTCGGCACGAGGCCACACAGCAACCAGCCAGCTGCCACACTAGCTTGAGGCGAGCGAGCG 
AAGCTTAGCTAGCGGATAGAACAAGTCGTCGATCTGCTTGCTGCTTTTGTGAATTGCGGTGG 
AAGCATGTCGAGCGCGTCGTCCATGGAAGCGCTCCACGCCGCGGTGCTCAAGGAGGAGCAGC 
AGC AGC AC GAG GT GG AGGAGGCG AC GG T C G T G AC GAGC AGC AGC GC C AC GAGC GGGG AGGAG 
GGCGGACACCTGCCCCAGGGGTGGGCGAAGCGGAAGCGGTCGCGCCGCCAGCGATCGGAGGA 
GGAGAACCTCGCGCTCTGCCTCCTCATGCTCGCCCGCGGCGGCCACCACCGCGTCCAGGCGC 
CGCCT'CCGCTCTCGGCTTCGGCGCCCCCGCCGGCAGGTGCGGAGTTCAAGTGCTCCGTCTGC 
GGCAAGTCCTTCAGCTCCTACCAGGCGCTCGGCGGCCACAAGACGAGCCACCGGGTCAAGCT 
GCCGACTCCGCCCGCAGCTCCCGTCTTGGCTCCCGCCCCCGTCGCCGCCTTGCTGCCTTCCG 
CCGAGGACCGCGAGCCAGCCACGTCATCCACCGCCGCGTCCTCCGACGGCATGACCAACAGA 
GTCCACAGGTGTTCCATCTGCCAGAAGGAGTTCCCCACCGGGCAGGCGCTCGGCGGGCACAA 
GAGGAAGCACTACGACGGTGGCGTAGGCGCCGGCGCCGGCGCATCTTCAACCGAGCTCCTGG 
CCACGGTGGCCGCCGAGTCCGAGGTGGGAAGCTCCGGCAACGGCCAGTCCGCCACCCGGGCG 
TTCGACCTCAACCTCCCGGCCGTGCCGGAGTTCGTGTGGCGGCCGTGCTCCAAGGGCAAGAA 
GATGTGGGACGAGGAGGAGGAGGTCCAGAGCCCCCTCGCCTTCAAGAAGCCCCGGCTTCTCA 
CCGCG rAA TTCAGCAGCTGCACGGATCCGATCCGTCAGAGTTTTTGTCTAGGGAGTGAAATT 
CAGTCGAAACACACTATTCGTTGATTCGTTTTGTGCCGCTATTGTTTAATTTGTTCCTGCTT 
TTGTACAGAGCAAGCGAGTGATACATAGCCATACATACAGTCATACAGATATAGGTCTAGCT 
CTTCCTTGGTTCTTTGTAACACTGGAACTGTACCTGTATCTTTTACACTTTGTTCTTTGACA 
GT CATATATTGTAGACCAAAAAAAAAAAAAAAAAA 

SEQ ID NO 19: Os_AF332 87 6 Oryza sativa seedling zinc finger 
transcription factor ZF1, STZ otholog, protein 

MS S AS SME ALHAAVLKEEQQQHE VEEAT WT S S SAT S GEEGGHLPQGWAKRKRSRRQRSEEE 
NLALCLIiMLARGGHHRVQAPPPLSASAPPPAGAEFKCSVCGKSFSSYQALGGHKTSHRVKLP 
T P PAAP VL AP AP VAALLP S AE DRE PAT S S TAAS S DGMTNRVHRC S I CQKE FPTGQ ALG GHKR 
KH YDGGVGAGAG AS S TELLAT VAAE SEVGS S GNGQ S ATRAFDLNLP AVPEFVWRPC SKGKKM 
WDEEEEVQSPLAFKKPRLLTA 
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SEQ ID NO 20: Ph_BAA0507 9_D2 6086 . 1 [Petunia x hybrida] . 
PETZFP4 zinc-finger protein gene • 

TTCACTCACCAAAACAACTTCTCTACCTCTTCTACTTGCACATTCAAATTCTTTCATTACTA 
CT T ATC TC T AC T AAT CT TG AT T C GAT T T T AGT AAAT C AAAC AAG AG AAT CT T T TC AGT AAT A 
CAAAC AAG AAAAT TTTCTCTC TAT ACT TG AT T GAGT T TAG T AAGGC AAAC AAG AAAACT AT C 
ATGGC ACT TG AAGC AT T GAAT TC TC C AAC T AC AAC AAC AC C ACC AT CAT TC CAAT TT GAGAA 
CAACGGGCTTAAGTACCTTGAGAGTTGGACAAAAGGTAAAAGATCAAAAAGGCAACGCAGCA 
TGGAACGACAGTGTACTGAAGAAGAGTATTTAGCACTTTGTCTTATCATGCTAGCACGTAGC 
GATGGTTCTGTTAATAACTCACGGTCTCTACCACCACCACCACTACCACCATCAGTTCCAGT 
AACGTCGCAAATAAACGCGACGTTATTGGAACAGAAGAATTTGTACAAGTGTTCCGTTTGTG 
GTAAAGGGTTTGGGTCTTATCAAGCTTTAGGTGGACATAAAGCAAGTCACCGGAAACTTGTC 
AGCATGGGAGGAGATGAACAATCTACTACTTCCACTACTACTAACGTAACGGGAACTAGTTC 
CGCTAACGTTAACGGTAACGGAAGAACTCACGAATGTTCAATTTGTCACAAGTGCTTTCCTA 
CTGGACAAGCTTTAGGTGGTCATAAAAGGTGCCACTATGACGGTGGTAACGGTAACGGTAAC 
GGAAGTGTAAGTGTTGGGGTGACGTCATCTGAAGGTGTGGGGTCCACTATTAGTCATCACCG 
TGACTTTGACTTGAATATTCCCGCGTTGCCGGAGTTTTGGCCGGGATTTGGTTCCGGCGAGG 
AT GAGGT GG AG AG T C C T C ATC C AGC AAAG AAG T C AAGG CT AT CT C T TC C AC C T AAAC T T GAA 
T TAT TC AAAGGAT TATAGAGG GAAT AT T GAT T TG T T AC AGGAAG AT T TAT TAG GAT T C ACG A 
AT TTTTTGTT GAC T AGT T TAT GT AAT AT 

SEQ ID NO 21: Ph_BAA05079 [Petunia x hybrida] zinc-finger 

protein, STZ otholog, protein 

MALEALNSPTTTTPPSFQFENNGLKYL^ 

DGSVNNSRSLPPPPLPPSVPVTSQINATLLEQKNLYKCSVCGKGFGSYQALGGHKASHRKLV 
SMGGDEQSTTSTTTWTGTSSANVNGNGRTHECSICHKCFPTGQALGGHKRCHYDGGNGNGN 
GSVSVGVTSSEGVGSTISHHRDFDLNIPALPEFWPGFGSGEDEVESPHPAKKSRLSLPPKLE 
LFKGL 

SEQ ID NO 22: Ta__BAA03901 Triticum aestivum gene for zinc- 
finger protein WZF1, complete cds 

ATGTCGTCGTCGGCCATGGAAGCGCTCCACGCCCTGATCCCGGAGCAGCACCAGCTGGACGT 
TGAGGCGGCTGCGGCTGTCAGCAGCGCCACCAGCGGCGAGGAGAGCGGCCACGTGCTGCAGG 
GGTGGGCCAAGAGGAAGCGATCGCGCCGCCAGCGCTCCGAGGAGGAGAACCTCGCGCTCTGC 
CTCCTCATGCTCTCGCGCGGCGGCAAGCAGCGTGTTCAGGCGCCGCAGCCGGAGTCGTTCGC 
TGCGCCGGTGCCTGCCGAGTTCAAGTGCTCCGTCTGCGGCAAGTCCTTCAGCTCCTACCAGG 
CGCTCGGAGGCCACAAGACGAGCCACCGGGTGAAGCAGCCGTCTCCTCCCTCTGATGCCGCT 
GCTGCCCCACTCGTGGCCCTCCCGGCCGTCGCCGCCATCCTGCCGTCCGCCGAGCCGGCCAC 
GTCGTCCACCGCCGCGTCCTCCGACGGCGCGACCAACAGAGTCCACAGGTGCTCCATCTGCC 
AAAAGGAGTTCCCGACTGGGCAGGCGCTCGGCGGGCACAAGAGGAAGCACTACGACGGAGGC 
GTGGGCGCCGCCGCCTCGTCGACCGAGCTTCTGGCCGCCGCGGCCGCCGAGTCTGAGGTGGG 
GAGCACCGGCAACGGGAGCTCCGCCGCCCGGGCCTTCGACCTGAACATTCCGGCCGTGCCGG 
AGTTCGTGTGGAGGCCGTGCGCCAAGGGCAAGATGATGTGGGAGGACGATGAGGAGGTGCAG 
AGCCCCCTCGCCTTCAAGAAGCCTCGGCTTCTCACCGCTTGA 
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SEQ ID NO 23: Ta__BAA03901_WZFl Triticum aestivum, STZ otholog, 
protein 

MSSSAMEALHALIPEQHQLDVEAAAAVSSATSGEESGHVLQGWAKRKRSRRQRSEEENIiALC 

LLMLSRGGKQRVQAPQPESFAAPVPAEFKCSVCGKSFSSYQALGGHKTSHRVKQPSPPSDAA 

AAPLVALPAVAAI LPS AE PAT S S TAAS S DGATNRVHRCS I CQKEFPTGQALGGHKRKHYDGG 

VGAAASSTELLAAAAAESEVGSTGNGSSAARAFDLNIPAVP 

SPLAFKKPRLLTA 

SEQ ID NO 24: Ca AF539746 Capsicum annum zinc finger protein 
mRNA, complete cds 

AAAATC T T CG C TACT T AC T T ACATC TT C T AGAAT AG TC AC T AGAAC C AG T AAC T T T AT AC AA 
CGGATATCGAT ATO GCACTTGAAGCTTTGAATTCTCCAACTGGTACACCAACTCCGCCACCG 
T T TC AAT T T GAGAGC GACGGC C AAC AGC T T CG AT AT AT CG AAAAC T GG AGG AAGGGAAAGAG 
ATCTAAAAGGTCACGCAGCATGGAGCACCAGCCTACTGAGGAAGAATACTTAGCGCTTTGTT 
TGATCATGCTTGCACGTAGCGGTGGCTCCGTTAATCATCAACGATCTCTACCACCGCCGGCT 
CCGGTGATGAAACTGCACGCGCCGTCGTCATCATCGGCGGCGGAGGAGGAGAAGGAGAAGAT 
GGTGTATAAGTGTTCGGTTTGTGGTAAGGGATTTGGGTCTTATCAAGCTTTAGGTGGACACA 
AAGCTAGTCACCGGAAACTCGTACCCGGCGGAGATGATCAGTCAACTACCTCCACAACCACT 
AACG C AAC C GG AAC AAC AACC TC CGT T AAC GGCAAC GGC AAC AGAAGT GGAAGGAC T CACG A 
GTGTTCGATTTGTCACAAGTGTTTTCCCACTGGACAAGCTTTAGGTGGACACAAAAGGTGTC 
ACTACGACGGCGGTATCGGTAACGGAAACGCTAACAGTGGCGTTAGTGCTAGCGTTGGAGTG 
ACGTCATCGGAGGGTGTGGGGTCCACAGTCAGTCACCGGGATTTCGACTTGAACATTCCGGC 
GTTGCCGGAATTCTGGCTGGGATTTGGTTCCGGCGAAGATGAGGTGGAGAGTCCACATCCGG 
CGAAGAAATCGCGGTTATGTTTGCCTCCAAAATATGAATTATTTCAACATMATGGGAATTT 

GAT T GT T AGG ATT TACT AT T T T GGT AG AC AAAAT TAT ACT AT GT AAGT T T T AAT T T T CAT T G 
TGGGTGGGAGCAAAATTTTTAATTTTTTGTCTATAGACCTAGCTAGTTACTAATAGCAAAAA 
T T C AAT TG AT T GAT T TAAAAAAAAAAAAAAAAAA 

SEQ ID NO 25: Ca AF539746-capsicum annum, STZ otholog, protein 
mLEALNSPTGTPTPPPFQFESDGQQLRYIElvMRKGKRSKRSRSMEHQPTEEEYIiALCLI^ 
ARSGGSVNHQRSLPPPAPVMKLHAPSSSSAAEEEKEKMVYKCSVCGKGFGSYQALGGHKASH 
RKLVPGGDDQSTTSTTTNATGTTTSVNGNGNRSGRTHECSICHKCFPTGQALGGHKRCHYDG 
GIGNGNANSGVSASVGVTSSEGVGSTVSHRDFDLNIPALPEFWLGFGSGEDEVESPHPAKKS 

RLCLPPKYELFQH 

PARAIOGS OF STZ IN ARABIDOPSIS THALIANA 

SEQ ID NO 26: gi_18402298_ref_NM_112848 . 1 mRNA 

ACTTCACTCTCTAATTTCCTTCTCTCTATCTCTCACCATATTCGCGATTAAAAACTCTCAAC 
T T TT CT CT C AAAT T TC TG AT C CT TT G ATC C AAC AG T T AG AAG AAGAT TC AT C T GAT C ATGGC 
CCTCGAAGCGATGAACACTCCAACTTCTTCTTTCACCAGAATCGAAACGAAAGAAGATTTGA 
TGAACGACGCCGTTTTCATTGAGCCGTGGCTTAAACGCAAACGCTCCAAACGTCAGCGTTCT 
CACAGCCCTTCTTCGTCTTCTTCCTCACCGCCTCGATCTCGACCCAAATCCCAGAATCAAGA 
TCTTACGGAAGAAGAGTATCTCGCTCTTTGTCTCCTCATGCTCGCTAAAGATCAACCGTCGC 
AAACGCGATTTCATCAACAGTCGCAATCGTTAACGCCGCCGCCAGAATCAAAGAACCTTCCG 
TACAAGTGTAACGTCTGTGAAAAAGCGTTTCCTTCCTATCAGGCTTTAGGCGGTCACAAAGC 
AAGTCACCGAATCAAACCACCAACCGTAATCTCAACAACCGCCGATGATTCAACAGCTCCGA 
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CCATCTCCATCGTCGCCGGAGAAAAACATCCGATTGCTGCCTCCGGAAAGATCCACGAGTGT 
TCAATCTGTCATAAAGTGTTTCCGACGGGTCAAGCTTTAGGCGGTCACAAACGTTGTCACTA 
CGAAGGCAACCTCGGCGGCGGAGGAGGAGGAGGAAGCAAATCAATCAGTCACAGTGGAAGCG 
TGTCGAGCACGGTATCGGAAGAAAGGAGCCACCGTGGATTCATCGATCTAAACCTACCGGCG 

T T AC C T GAAC T C AGCC T T CAT C AC AAT CC AATC GT CG AC GAAG AGATC T T GAGT CC GT T G AC 
CGGTAAAAAACCGCTTTTGTTGACCGATCACGACCAAGTCATCAAGAAAGAAGATTTATCTT 
TAAAAATCTAATACTCGACTATTAATTCTTGTGTGATTTTTTTCGTTACAACCATAGTTTCA 
TTTTCATTTTTTTAGTTACAAATTTTTAATTGTTCTGATTTGGATTGAATATTGGTATATTG 

TTAGGGGTTGATAC 

SEQ ID NO 27: Translation of gi__18402298_ref _NM_112848 . 1_ 
MALEAMNTPTSSFTRIETKEDLMNDAVFIEPW 

QDLTEEEYLALCLLMLAKDQPSQTRFHQQSQSLTPPPESKNLPYKCNVCEKAFPSYQALGGH 
KASHRIKPPTVISTTADDSTAPTISIVAGEKHPIAASGKIHECSICHKVFPTGQAIiGGHKRC 
HYEGNLGGGGGGGSKSISHSGSVSSTVSEERSHRGFIDLNLPALPELSLHHNPIVDEEILSP 
LT GKKPLLLT DH DQ VI KKE D L S LKI 

SEQ ID NO 28: gi_30 680473_ref_NM_120516 . 3_mRNA 

AAATCAAATCTTTTCATTTACAATTATCTTTCTTCTCAATTTAGAACTTAGTAGCTAGTCTT 
CAAGATAATGGCACTTGAAACTCTTACTTCTCCAAGATTATCTTCTCCGATGCCGACTCTGT 
TTCAAGATTCAGCACTAGGGTTTCATGGAAGCAAAGGCAAACGATCTAAGCGATCAAGATCT 
G AAT TC GACC GT C AGAGT C T C ACGG AG GAT G AAT AT AT C G C T T T AT GT CT C AT GCT TC T T GC 
TCGCGACGGAGATAGAAACCGTGACCTTGACCTGCCTTCTTCTTCGTCTTCACCTCCTCTGC 
TTCCTCCTCTTCCTACTCCGATCTACAAGTGTAGCGTCTGTGACAAGGCGTTTTCGTCTTAC 
CAGGCTCTTGGTGGACACAAGGCAAGTCACCGGAAAAGCTTTTCGCTTACTCAATCTGCCGG 
AGGAGATGAGCTGTCGACATCGTCGGCGATAACCACGTCTGGTATATCCGGTGGCGGGGGAG 
GAAGTGTGAAGTCGCACGTTTGCTCTATCTGTCATAAATCGTTCGCCACCGGTCAAGCTCTC 
GGCGGCCACAAACGGTGCCACTACGAAGGAAAGAACGGAGGCGGTGTGAGTAGTAGCGTGTC 
GAATTCTGAAGATGTGGGGTCTACAAGCCACGTCAGCAGTGGCCACCGTGGGTTTGACCTCA 
ACATACCGCCGATACCGGAATTCTCGATGGTCAACGGAGACGAAGAGGTGATGAGTCCTATG 
CCGGCGAAGAAACTCCGGTTTGACTTCCCGGAGAAACCCTAAACATAAACCTAGGAAAAACT 
T T AC AG AAT T C ATT T T AT AGG AAAT TGT T T T AC T GT ATAT AC AAAT AT CG AT T T TG AT T GAT 
GTTCTTCTTCACTGAAAAATTATGATTCTTTGTTGTATAATTGATGTTTCTGAAAAAGATAT 
AACTTTTTATTGTTTCACACGTATCAAAATTTGCTTGGATACATCA 

SEQ ID NO 29: Translation of gi_30680473_ref JSIM_120516 . 3_ 
MALETLTSPRLSSPMPTLFQDSALGFHGSKGKRSKRSRSEFDRQSLTEDEYIALCLMLLARD 
GDRNRDLDLPSSSSSPPLLPPLPTPIYKCSVCDKAFSSYQALGGHKASHRKSFSLTQSAGGD 
ELSTSSAITTSGISGGGGGSVKSHVCSICHKSFATGQALGGHKRCHYEGKNGGGVSSSVSNS 
EDVGST SHVS SGHRGFDLNI PPI PEFSMVNGDEEVMS PMPAKKLRFDFPEKP 
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SEQ ID NO 30: gi_30 693252_ref_NM__114853 . 2_mRNA 

ATGGCTCTCGACACTCTCAATTCTCCCACCTCCACCACCACAACCACCGCTCCTCCTCCTTT 
CCTCCGTTGCCTCGACGAAACCGAGCCCGAAAACCTCGAATCATGGACCAAAAGAAAACGTA 
CAAAACGTCACCGTATAGATCAACCAAACCCTCCTCCTTCTGAAGAAGAGTATCTCGCTCTT 
TGCCTCCTTATGCTCGCTCGTGGCTCCTCCGATCATCACTCTCCACCGTCGGATCATCACTC 
TCTTTCTCCACTGTCCGATCATCAGAAAGATTACAAGTGTTCCGTCTGTGGCAAATCTTTCC 
CGTCTTACCAAGCGTTAGGTGGACACAAAACAAGTCACCGGAAACCGGTTAGTGTCGATGTT 

AAT AAT AG T AAC GG AACC GT T AC T AAT AACGGAAAT AT T AGT AACGGT T TAG T T GG T C AAAG 
TGGGAAGACTCATAACTGCTCTATATGTTTTAAGTCGTTTCCCTCTGGTCAAGCATTGGGTG 
GTCACAAACGTTGTCACTATGATGGTGGTAACGGTAACAGTAACGGTGACAATAGCCACAAG 
TTTGACCTAAATTTACCGGCTGATCAAGTTAGTGATGAGACAATTGGAAAAAGTCAACTCTC 
CGGTGAAGAAACAAAGTCGGTGTTGTGATTATTATTATTTTTTACCGATCGGGATTAGCTAG 
TGGT T GAT CAT TAG C T G AGT C TG T AAT GAAAAT GAT 

SEQ ID NO 31: Translation of giJ30693252_ref JNMJL14853 . 2 
MALDTLNSPTSTTTTTAPPPFLRCLDETEPENLESWTKRKRTKRHRIDQPNPPPSEEEYLAL 
CLLMIiARGSSDHHSPPSDHHSLSPLSDHQKDYKCSVCGKSFPSYQALGGHKTSHRKPVSVDV 
NNSNGTVTNNGNISNGLVGQSGKTHNCSICFKSFPSGQALGGHKRCHYDGGNGNSNGDNSHK 

FDLNLPADQVS DE T I GKSQLS GEE TKS VL 

SEQ ID NO 32: gi_30694224_ref JSIM_123683 . 2_mRNA 

AAATTTTCTATAGCAATGGCGCTTGAAGCTCTTAATTCACCAAGATTGGTCGAGGATCCCTT 

AAGAT T C AAT GGCG T T GAGC AGT GG AC C AAAT G T AAG AAACG AT CC AAAC GT TC G AGATC T G 
ATCTTCATCATAACCACCGTCTCACTGAGGAAGAGTATCTAGCTTTCTGTCTCATGCTTCTT 
GCTCGGGATGGCGGCGATCTTGACTCTGTGACGGTTGCGGAGAAGCCGAGTTATAAGTGTGG 
CGTTTGTTACAAGACGTTTTCGTCTTACCAAGCTCTCGGCGGTCATAAAGCGAGCCACCGGA 
GCTTATACGGTGGTGGAGAGAATGATAAATCGACACCATCCACCGCCGTGAAATCTCACGTT 
TGTTCGGTTTGCGGGAAATCTTTCGCCACCGGTCAAGCTCTCGGCGGCCACAAGCGGTGCCA 
CTACGATGGTGGCGTTTCGAACTCGGAAGGTGTGGGGTCTACTAGCCACGTCAGCAGTAGTA 

GC C ACC GT GG AT T T GACC TT AAT AT T AT ACC GG TGCAGG GAT T T T C GC CG GACG AC GAAG T G 
ATGAGTCCGATGGCGACTAAGAAGCCTCGCCTGAAGTAAGTCTTTGTTGAAGACCTGGAAGT 
TTATCAAATGTAAATATCAAATTTCAATTTCAAGGAACAGTTTTGTTGATTCTATTACCAAT 

AC AC AAT ACGAT TC AAT T CC 

SEQ ID NO 33: Translation of giJ30694224_ref JflM_123683 . 2_ 
MALE ALNS PRLVEDPLRFNG VEQWTKCKKRSKRSRS DLHHNHRLTEEE YL AFCLMLLARDGG 
DLDSVTVAEKPSYKCGVCYKTFSSYQALGGHKASHRSLYGGGENDKSTPSTAVKSHVCSVCG 
KSFATGQALGGHKRCHYDGGVSNSEGVGSTSHVSSSSHRGFDLNIIPVQGFSPDDEVMSPMA 

TKKPRLK 
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SEQ ID NO 34: gi_306 98307_ref JSTM_126145 .2_mRNA 

CACACTTCACTCTTTCTTCATCTTCTTCTTCTTAAATAGCTCGAAATCACATCTCACAGAAT 
TAAATCTTATGGCTCTCGAGACTCTCAATTCTCCAACAGCTACCACCACCGCTCGGCCTCTT 
CTCCGGTATCGTGAAGAAATGGAGCCTGAGAATCTCGAGCAATGGGCTAAAAGAAAACGAAC 
AAAACGTCAACGTTTTGATCACGGTCATCAGAATCAAGAAACGAACAAGAACCTTCCTTCTG 
AAGAAGAGTATCTCGCTCTTTGTCTCCTCATGCTCGCTCGTGGCTCCGCCGTACAATCTCCT 
CCTCTTCCTCCTCTACCGTCACGTGCGTCACCGTCCGATCACCGAGATTACAAGTGTACGGT 
CTGTGGGAAGTCCTTTTCGTCATACCAAGCCTTAGGTGGACACAAGACGAGTCACCGGAAAC 
CGACGAACACTAGTATCACTTCCGGTAACCAAGAACTGTCTAATAACAGTCACAGTAACAGC 
GGTTCCGTTGTTATTAACGTTACCGTGAACACTGGTAACGGTGTTAGTCAAAGCGGAAAGAT 
TCACACTTGCTCAATCTGTTTCAAGTCGTTTGCGTCTGGTCAAGCCTTAGGTGGACACAAAC 
GGTGTCACTATGACGGTGGCAACAACGGTAACGGTAACGGAAGTAGCAGCAACAGCGTAGAA 
CTCGTCGCTGGTAGTGACGTCAGCGATGTTGATAATGAGAGATGGTCCGAAGAAAGTGCGAT 
CGGTGGCCACCGTGGATTTGACCTAAACTTACCGGCTGATCAAGTCTCAGTGACGACTTCTT 
AACGTTGACTGAGTTTGAGGAAAAAGTCAACTATCAAGCGAAGAAAGGGTTAGTGGACGGTG 
AAGATTAACGGTCGTTTCTTTCCAGTTGCTTCGGTTTGAGCTTGACTGGGTCTGTAATGAAA 
ATGATTGGAGTGGACTTGGCATTATTATTATTATTTTTAAAAAGAAATGTTAATTTGTTGTT 
GG AT T T GT T TAT AG AT AG AG G AAAC AATT GGG AT AC AC AAAT AT TTTTTTTTTT T AC AAAGA 
AAAT AATAAT GC AG AG AT GG AT GAT TGGATC GT AC AC GT TAT T AT AT AGT GG AC CAT T CT GT 
AATCGT GAAT TAT TAT TAT T T GT TAG AAAT T T AAT T T TC G T 

SEQ ID NO 35: Translation of gi_30698307_ref_NM_126145 . 2_ 
MALE TLNS PT AT T T ARPLLRYREEME PENLE QWAKRKRTKRQRF DHGHQNQE TNKNLP SEEE 
YLALCLIJyiIARGSAVQSPPLPPLPSRASPSDHRDYKCTVCGKSFSSYQALGGHKTSHRKPTN 
TSITSGNQELSNNSHSNSGSWI3WTVNTGNGVSQSGKIHTCSICFKSFASGQALGGHKRCH 
YDGGNNGNGNGSSSNSVELVAGSDVSDVDNERWSEESAIGGHRGFDLNLPADQVSVTTS 

OTHER GENES IN EVALUATION 

SEQ ID NO 36: gi_ 12698881_ref_ AF_33287 6.1 _mRNA, 2xC2H2, 
Oryza sativa 

AATTCGGCACGAGGCCACACAGCAACCAGCCAGCTGCCACACTAGCTTGAGGCGAGCGAGCG 
AAGCTTAGCTAGCGGATAGAACAAGTCGTCGATCTGCTTGCTGCTTTTGTGAATTGCGGTGG 
AAGCATGTCGAGCGCGTCGTCCATGGAAGCGCTCCACGCCGCGGTGCTCAAGGAGGAGCAGC 
AGCAGCACGAGGTGGAGGAGGCGACGGTCGTGACGAGCAGCAGCGCCACGAGCGGGGAGGAG 
GGCGGACACCTGCCCCAGGGGTGGGCGAAGCGGAAGCGGTCGCGCCGCCAGCGATCGGAGGA 
GGAGAACCTCGCGCTCTGCCTCCTCATGCTCGCCCGCGGCGGCCACCACCGCGTCCAGGCGC 
CGCCTCCGCTCTCGGCTTCGGCGCCCCCGCCGGCAGGTGCGGAGTTCAAGTGCTCCGTCTGC 
GGCAAGTCCTTCAGCTCCTACCAGGCGCTCGGCGGCCACAAGACGAGCCACCGGGTCAAGCT 
GCCGACTCCGCCCGCAGCTCCCGTCTTGGCTCCCGCCCCCGTCGCCGCCTTGCTGCCTTCCG 
CCGAGGACCGCGAGCCAGCCACGTCATCCACCGCCGCGTCCTCCGACGGCATGACCAACAGA 
GTCCACAGGTGTTCCATCTGCCAGAAGGAGTTCCCCACCGGGCAGGCGCTCGGCGGGCACAA 
GAGGAAGCACTACGACGGTGGCGTAGGCGCCGGCGCCGGCGCATCTTCAACCGAGCTCCTGG 
CCACGGTGGCCGCCGAGTCCGAGGTGGGAAGCTCCGGCAACGGCCAGTCCGCCACCCGGGCG 
TTCGACCTCAACCTCCCGGCCGTGCCGGAGTTCGTGTGGCGGCCGTGCTCCAAGGGCAAGAA 
GATGTGGGACGAGGAGGAGGAGGTCCAGAGCCCCCTCGCCTTCAAGAAGCCCCGGCTTCTCA 
CCGCGTAATTCAGCAGCTGCACGGATCCGATCCGTCAGAGTTTTTGTCTAGGGAGTGAAATT 
CAGTCGAAACACACTATTCGTTGATTCGTTTTGTGCCGCTATTGTTTAATTTGTTCCTGCTT 
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TTGTACAGAGCAAGCGAGTGATACATAGCCATACATACAGTCATACAGATATAGGTCTAGCT 
CTTCCTTGGTTCTTTGTAACACTGGAACTGTACCTGTATCTTTTACACTTTGTTCTTTGACA 
GT CAT AT AT T GT AG AC C AAAAAAAAAAAAAAAAAA 

SEQ ID NO 37: gi_12 6988 82_ref_AAK0 1713 . 1 , 2xC2H2, Oryza sativa 
MS S AS SME ALHAAVLKEE QQQHE VEEAT WT S S SATS GEEGGHL PQGWAKRKRSRRQRSEEE 
NLALCLLMIiARGGHHRVQAPPPLSASAPPPAGAEFKCSVCGKSFSSYQALGGHKTSHRVKLP 
TPPAAPVL APAP VAALLPS AE DREP AT S S TAAS S DGMTNRVHRC S I CQKE FP TGQALGGHKR 
KH YDGGVGAGAG AS S TELLAT VAAE SE VGS S GNGQ S ATRAFDLNLP AVPE FVWRPC SKGKKM 
WDEEEEVQSPLAFKKPRLLTA 

SEQ ID NO 38: gi_643421 5_r ef _AL1 32966. l_regi on 116202 
116729, 2xC2H2 f Arabidopsis thaliana 

AT G AAG AG AG AC CG GT CC GAT T ACG AAGAAT CC AT GAAGC AT AT AG AC AT AG T AGAAAGT CT 
AATGATGTTATCTCGAAGTTTCGTGGTCAAACAAATCGATGTAAAGCAATCTACCGGAAGCA 
AAACGAACCATAATAACCACTTCGAATGCAAAACGTGTAACCGGAAATTTGATTCCTTCCAA 
GCTCTTGGAGGTCATAGAGCTAGCCACAAGAAACCTAAGCTGATCGTTGACCAAGAACAGGT 
GAAGC AT CGTAAC AAAGAGAAT GATATGCATAAGTGT ACAATT TGCGAT CAAAT GT TTGGGA 
C CGG TC AAGC TC T AGG CGGT C AC AT GAG AAAGC AT AG GAC GAGC AT GATAAC CG AG CAAT CG 
ATTGTCCCTTCTGTGGTTTATTCCAGACCGGTTTTTAATCGTTGCAGTAGCAGCAAGGAGAT 
CTTGGACTTAAATCTAACTCCATTGGAAAATGATCTTGTGTTAATCTTTGGGAAGAATTTGG 
TTCCACAAATTGATTTGAAGTTTGTGAATTAG 

SEQ ID NO 39: gi_6729511_ref_CAB67 667 . 1, 2xC2H2, Arabidopsis 
thaliana 

MKRDRSDYEESMKHIDIVESLMMLSRSFVVKQIDVKQSTGSKTNHNNHFECKTCNRKFDSFQ 
ALGGHRAS HKKPKL I VDQEQVKHRNKENDMHKC T I CDQMFGT GQ ALGGHMRKHRT SMI TEQS 
IVPSWYSRPVFNRCSSSKEILDLNLTPLENDLVLIFGKNLVPQIDLKFVN 

SEQ ID NO 40: ref_CA279020 , 2xC2H2 f sugar cane 

CCTAACCAGCATTAGCTTTTCAAATCAACAAGCCTCGCCGTGACCGATCGATGGCCATCACC 
CACGACGACTACGTCTCCCTCTGCCTCATGGCGCTCGCAGCCGCGGGAGGCGGAGGCCAAGC 
TGGTTTAACAACGCAGTACGCTCTGAACACGGCTGCCTGGACAGCGACGGCGCAAGAGTCCG 
AGCTCCGCTTCCGGTGCTCCGTCTGTGGCAAGGCCTTCGCGTCGCACCAGGCACTGGGCGGG 
CACAAGGCCAGCCACCGCAAGCCGACGCTCGTACAGGCACATGCGTCGTCCTCAGCCGGAGG 
CGCGGCGTCGTCGTCGGTAACAATGACCTCGGCCGTAGGCAGCAGTGGGCAGGGGAGGCACA 
GGTGCACGGTGTGCCATCGGAGCTTCGCGACGGNGCAAGCGCTCGGCGGGCACAAGAGGTGC 
CATTACTGGGACGGGCTCTCGGTGTCGCTCACCGCGTCGTCGGCGCCATCGGGGTCCGGGTC 
GACCGTCAAGGGCTTTGATCTGAATTTGGTGCCGGTGCCGCCCGCGATGGCCGCCAACGCTG 
CGACAAGGTGGGGAGAGGAGAANNAAGTCANAAACCCTTGGCGGTCAAGAGAAGGCGGCTTG 
CCGGTCCGTCTTGGACCCTAATTTAACGATTTAGAAGTCCTTTTTTTAATAATTAAGAGTTC 
TTTTGAAGAAGGTTGTAAAGTTTTCGAACCTTGTTCTTTTAATGGATTTGGGTGCTGGCGAA 
ATTTTAAAACTGGATTTAAATTTGCGCTCACTCTTTTTTTTTATTTTTTACACCCTTTTTTT 

T T T T TAG AAG AAG A 
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SEQ ID NO 41: gi_18027011_ref_AF254447 . 1, 2xC2H2, Arabidopsis 
thaliana 

TTCCTTTCTCTTCCTCTCTCTCTCTCTTCACCATGACTGATCCTTATTCCAATTTCTTCACA 
GACTGGTTCAAGTCTAATCCTTTTCACCATTACCCTAATTCCTCCACTAACCCCTCTCCTCA 
TCCTCTTCCTCCTGTTACTCCTCCCTCTTCCTTCTTCTTCTTCCCTCAATCCGGAGACCTCC 
GCCGTCCACCGCCGCCACCAACTCCTCCTCCTTCTCCTCCTCTCCGAGAAGCCCTCCCTCTC 
CT C AGC C T C AGCCC C GCC AAC AAAC AAC AAG AC C ACC AT C AC AACC AT GACC AC CT TAT T C A 
AG AACC AC CT T C AACC T C CAT GG AT GT CG AC T AC GAT CAT C ACC AT C AAG AT GATC AT CAT A 
ACCTCGATGACGATGACCATGACGTCACCGTTGCTCTTCACATAGGCCTTCCAAGCCCTAGT 
GCTCAAGAGATGGCCTCTTTGCTCATGATGTCTTCTTCTTCCTCTTCCTCGAGGACCACTCA 

TC AT C ACG AGGAC ATG AAT C AC AAGAAAG AC CT CG AC CAT G AGT AC AG CC AC GG AGC T GT CG 
GAGGAGGAGAAGATGACGATGAAGATTCAGTCGGCGGAGACGGCGGCTGTAGAATCAGCAGA 
CTCAACAAGGGTCAATATTGGATCCCTACACCTTCTCAGATTCTCATTGGCCCTACTCAGTT 
CTCATGTCCTGTTTGCTTCAAAACCTTCAACAGATACAATAACATGCAGATGCATATGTGGG 
GACATGGAT C AC AAT AC AGAAAAGG AC C T GAAT C T C T AAG GGGAAC AC AACC AAC AGG AATG 
CTAAGGCTTCCGTGCTATTGCTGCGCCCCAGGCTGTCGCAACAACATTGACCATCCAAGGGC 
AAAGCC TCTCAAAG AC TT CAGAACCCT TC AAAC AC AT TAC AAGAGAAAACATGGGATC AAAC 
CTTTCATGTGTAGGAAATGTGGAAAGGCTTTCGCAGTCCGAGGGGACTGGAGAACACATGAG 
AAGAATTGTGGCAAACTTTGGTATTGCATATGTGGATCTGATTTCAAGCACAAGAGATCTCT 
CAAAGATCACATCAAGGCTTTTGGGAATGGTCATGGAGCCTACGGAATTGATGGGTTTGATG 
AAGAAG AT GAGC CTGCCTCT G AGGT AG AAC AAT T AGAC AAT GAT C ATG AG TC AATGC AGT C T 
AAAT AGC T TAT AT AT AT TAC T AT AAGT AC T AAG T AAT TC GGT AT AT AT AT T AAT TAT AAG AA 
ACCTAAAT CT ATGGACCAAGT TT TGAT GGAGGTAGGGCT T T TCAAACT AAAAGC TAT AT CAT 
C T AAT T GAT C AT AGGAAAAAAAT GAAT C AAG AGC AC T TGGAAAAT T T T AAAT TG T ATC T T T A 
GCTTCCTAGTTAAATTTATTGCAAGACAATGTAGCAGTCTAACCAATGAGGTTCCCAACGGT 
TTATTTCTATTTGTATATTATTTTGTCATTAGCTTCACCTTTCGTTAATTCGAAGGACATAA 
CTTATAAATGTTTAAATTATG 

SEQ ID NO 42: At3g57670, 2xC2H2, Arabidopsis thaliana 

MTDPYSNFFTDWFKSNPFHHYPNSSTNPSPHPLPPVTPPSSFFFFPQSGDLRRPPPPPTPPP 

SPPLREALPLLSLSPAJSIKQQDHHHNHDHLIQEPPSTSMDVDYDHHHQDDHHNLDDDDHDVTV 

ALHIGLPSPSAQEmSLI^SSSSSSSRTTHHHEDl^HKKDLDHEYSHGAVGGGEDDDEDSV 

GGDGGCRISRLNKGQYWIPTPSQILIGPTQFSCPVCFKTFNRYNmQMHMWGHGSQYRKGPE 

SLRGTQPTGMLRLPCYCCAPGCRNNIDHPRAKPLKDFRTLQTHYKRKHGIKPFMCRKCGKAF 

AVRGDWRTHEKNCGKLWYCICGSDFKHKRSLKDHIKAFGNGHGAYGIDGFDEEDEPASEVEQ 

LDNDHESMQSK 

SEQ ID NO 43: gi_18 676370_ref_AJ311810 .2, 2xC2H2, Arabidopsis 
thaliana 

AT CT AC AC AC TACT AC T C AC ATC TC AT C T CT CT CT AG C AC AT AC CC AT C AAACC AT AT AG AT 
ACGGTGCTTTTATTCTTGATCTTCTTCTTCTTCTTTGTCTTCTCCTCAGAGTCATGTCTAAT 
CCAGCTTGTTCGAATCTCTTCAACAATGGATGTGACCATAATAGCTTCAACTATTCCACTTC 
TCTCTCTTACATTTACAACTCTCACGGTAGCTACTATTACTCTAATACCACAAACCCTAATT 
ACATTAATCATACTCATACCACTTCCACTTCCCCTAACTCACCCCCACTAAGAGAAGCTCTT 
CCTCTTCTTAGCTTAAGCCCCATAAGGCACCAAGAACAACAAGACCAACACTATTTCATGGA 
CACCCATCAAATTAGCTCTTCAAACTTTCTTGATGATCCTCTTGTGACTGTGGATCTTCATC 
TAGGGTTACCAAACTACGGTGTTGGTGAGAGCATTAGGAGCAATATTGCTCC TGAT GC AACC 
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ACGGACGAGC AAGAT C AAGAT C ATG ACCGAGGAGT AG AAGTC AC AGT T GAGT CC C ACC T TGA 
T GAT G ATG AT GATC AT C ATGG AG AT CT AC AC AGAGGT C AT CAC TAT T GGAT T CC T AC T CC TT 
CTCAGATTTTGATTGGTCCTACACAGTTCACTTGTCCTCTTTGCTTCAAGACATTCAACAGA 
TACAACAACATGCAGATGCACATGTGGGGACACGGCTCACAATACAGAAAGGGACCAGAATC 
C T T AAG AG GAAC CC AACC AAC AG GAAT GC T AAG AC T ACC ATG T TTCTGCTGT GC AC CC GG T T 
GCAAGAAC AAC AT T GACC AC CC AC G AG CC AAGC CT C T T AAGGAC T T T C G AAC CC TC C AAAC A 
CATTACAAACGTAAACATGGGTCTAAACCATTTGCTTGTCGTATGTGTGGTAAGGCCTTTGC 
AG TG AAAGG AGAT T GG AG AAC GC AT G AGAAG AAT T GT GG AAAGC T T T GGT AT TGCTCTTGTG 
GCTCGGATTTTAAGCACAAGAGGTCGCTTAAGGACCATGTCAAGGCCTTTGGAAA.TGGTCAT 
GTTCCTTGTGGGATTGATAGTTTTGGAGGAGATCATGAGGACTACTATGATGCTGCTTCTGA 
TATCGAGCAATAAGATGATAGCAACAACAATGAGTGTTAATTAGGGGTTTTGTTTATTTTTC 
CTCTCATGCATTAGTTGATTGTATGCACGTGTTCTTTAGTTTTGTTCTTCGGATCTTTGTTT 
TATTTTGTTTTGAGCTGTTTTTTTTTTAATTACTAAGAAGTTAATTATCATCTAAAGATTTT 

C 

SEQ ID NO 44: gi_1837 64 98_ref_CAC8 6167 . 1, 2xC2H2, Arabidopsis 
thaliana 

MSNPACSNLFNNGCDHNSFNYSTSLSYIYNSHGSYYYSNTTNPNYINHTHTTSTSPNSPPLR 
EALPLLSLSPIRHQEQQDQHYFMDTHQISSSNFLDDPLVTVDLHLGLPNYGVGESIRSNIAP 
DATTDEQDQDHDRGVEVTVESHLDDDDDHHGDLHRGHHYWIPTPSQILIGPTQFTCPLCFKT 
FNRYNNMQMHMWGHGSQYRKGPESLRGTQPTGMLRLPCFCCAPGCKNNIDHPRAKPLKDFRT 
LQTHYKRKHGSKPFACRMCGKAFAVKGDWRTHEKNCGKLWYC SCGS DFKHKRSLKDHVKAFG 

NGHVPC G I D S F GG DHE D Y Y DAAS D I EQ 

SEQ ID NO 45: gi_7798991 - ref_AL355775 . l_region 7957 ... 8451,, 
2xC2H2 f Arabidopsis thaliana 

ATGGTTGCGAGAAGTGAGGAAGTTGAGATAGTGGAAGATACGGCGGCGAAATGTTTGATGTT 
GTTATCAAGAGTTGGAGAATGCGGCGGAGGAGGAGAGAAACGAGTTTTCCGATGCAAGACTT 
GTCTTAAAGAGTTTTCGTCGTTTCAAGCTTTGGGAGGTCATCGTGCAAGCCACAAGAAACTC 
ATTAACAGTAGCGATCCATCACTTCTTGGATCCTTGTCTAACAAGAAAACTAAAACGGCGAC 
GTCTCATCCTTGTCCGATATGTGGCGTGGAGTTTCCGATGGGGCAAGCTCTTGGTGGTCACA 

T G AG GAGAC AT AGG AG TG AGAAAGC CT C ACC AGGC AC GT T GG TT AC AC GT T C T T T T T T ACCG 
GAGACGACGACGGTGACGACTTTGAAAAAATCGAGTAGTGGGAAGAGAGTGGCTTGTTTGGA 
CT T AGAT T CG AT GG AG AG T T T AGTC AAT T GG A&GT T GGAGT T GGGAAGAACG AT T T CT TG A 

SEQ ID NO 46: gi_7798996_ref_CAB90935 . 1, 2xC2H2, Arabidopsis 
thaliana 

MVARSEEVEIVEDTAAKCLMLLSRVGECGGGGEKRVFRCKTCLKEFSSFQALGGHRASHKKL 
INSSDPSLLGSLSNKKTKTATSHPCPICGVEFPMGQALGGHMRRHRSEKASPGTLVTRSFLP 
ET T T VT TLKKS S S GKRVACLDLD SME S LVNWKLE LGRT I S 

SEQ ID NO 47: gi_9755794_ref_AL3 91143 . 1 ^region 31730 ... 
32938, 2xC2H2, Arabidopsis thaliana 

ATGGAAGACGAACATCAAGATCTCCATAAACCCATTAATGGAGCTTTGCGAGACCTCAAGAT 
TACTCGGTCACAG2VAAGAAACAGAAAAGTCTACGAACC2VACAGCAAGATGTTACTTGTTACT 
AT GG TC T AAG GG AAAACT CG AAG AAGAAAAC CC AGGAAT C TC C GG AACC AAT GAAGAAGAT T 
TTGTTTCGATGCGAAGAATGTGGAAAAGGGTTTCGGTACGAGAAATATTTTAAGAATCATCG 
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CTCGATGATGCATTTATCGCCGAACGAG2VAGGTTTGTGAAGAATCCTTGATGACTCTGTCTC 

GT AGCC T T GG GT T T GT GAAG AAG AAGAAAAGAT C AAG AC T T G GT AGGT C T GGGAAG AC T T T A 
TTTACTACGTTTCTTGAACCGAGTTCTATTTTTGATGCGACTGATGAAGAATTAGAAGTGGC 
GGATTGTTTGATTCTATTGTCTAAGAGTGCTCCCAAGGTTGTAGACGAATTGAAAAGTCTTT 
CTGAGGCAGTACGTGTTACTCCTGAAACACCTGAAAGTAGCTATGATTTGGGTTGTTTGCTC 
AACAAGAAACCGAGAAAAGGTGGTGAATTGGAATCTGGGGTTTTAAGTAATGAGCAAAGACT 
T ATGGAAG AAGG GT TT AG TAG T TAT GG AAC AT C GAAAGAACC AGCT AGCT T C T T GAGAGACG 
AAAAC AGAT T GG AT C AGC AG AAACGG AGAAAAGAT GG T G AAT T T GAAT CC GG AC T T T T G AGT 
AATG AGC AAAGAC T GC T AG AAG AAGAGAT T ACT AC TC C T G TG AC AT T C AAAG GT CC AGCG AG 
TTCCTTGAGACACAAGTGTGCTTTGGATCGAAATGGAGGTGAATTTGGTCCTGAGTTTTTGA 
GT AATG AGC AAAC ACT GATGGAAGAAACATGGAAAGAAC C AG T GAG T T TC T T AG AAGAT AAG 
CATGAAT TTGATCAGCGGAAAATGCGAGAAGCT GGCGAC TT TGAATC TAGGT TT TACAGAAT 
T G AGC T TGGAGT AGGAGC TAT GG AG TG TAC T T C T T C AGAT AC T GAT AT GC T C AC GC AATC T G 
AT AAG AAG AACG T T GAGC AT CGATGC AGG T T GT GC AAC AAGAT AT TCTCGTCT TAT C AAGC T 
CT AGGGGG T CAT C AGACG T T T CAT C GG AT GAGC AAAT GT AAGAACAAGAAGAAT GGC AT AG A 
GGAATCAGTTGAACCCAGGATGACTCTGTGA 

SEQ ID NO 48: gi_9755803_ref_CAC01747 . 1, 2xC2H2, Arabidopsis 
thaliana 

MEDEHQDLHKPINGALRDLKITRSQKETEKSTNQQQDVTCYYGLRENSKKKTQESPEPMKKI 
LFRCEECGKGFRYEKYFKNHRSMMHLSPNEKVCEESLMTLSRSLGFVKKKKRSRLGRSGKTL 
FTTFLEPSSIFDATDEELEVADCLILLSKSAPKWDELKSLSEAVRVTPETPESSYDLGCLL 
NKKPRKGGELESGVLSNEQRLMEEGFSSYGTSKEPASFLRDENRLDQQKRRKDGEFESGLLS 
NEQRLLEEEITTPVTFKGPASSLRHKCALDRNGGEFGPEFLSNEQTIJyLEETWKEPVSFLEDK 
HEFDQRKMRE AGDFESRF YRIELGVGAMECT S S DT DMLTQSDKKNVEHRCRLCNKI FS S YQA 
LGGHQTFHRMSKCKNKKNGIEESVEPRMTL 

SEQ ID NO 49: giJL418338_ref_X98678 . 1, 2xC2H2, Arabidopsis 
thaliana 

CTTGTTAGTTCACTCCACATAATAAACACCAAAGATTTCATTCTCTTCTCCATAATTTCGAA 
GTTTCTTGAATTGGGTTTGTTTCTTGATTTGTTTCTTGAATTGGGTTTTGGTCTTCTTTTCT 
TACTATATTTGGATATGATGATGGGTCAAGATGAGGTTGGGAGTGATCAGACGCAAATCATA 
AAAGGGAAACGTACGAAGCGACAAAGATCGTCTTCGACGTTTGTGGTGACGGCGGCGACAAC 
AGTGACTTCAACAAGTTCATCGGCCGGTGGAAGTGGAGGAGAAAGAGCTGTTTCAGATGAAT 
ACAACTCGGCGGTTTCGTCTCCGGTGACTACTGATTGTACGCAAGAAGAAGAAGACATGGCG 
ATTTGTCTCATCATGTTAGCTCGTGGGACAGTTCTTCCATCGCCGGATCTCAAGAACTCGAG 
AAAAATTCATCAGAAGATTTCGTCGGAGAATTCTAGTTTCTATGTGTACGAGTGTAAAACGT 
GTAACCGGACGTTTTCGTCGTTCCAAGCACTTGGTGGACACAGAGCGAGCCACAAGAAGCCG 
AGGACGTCGACTGAGGAAAAGACTAGACTACCCCTGACGCAACCCAAGTCTAGTGCATCAGA 
AGAAGGGCAAAACAGTCATTTCAAAGTTTCCGGCTCAGCCCTAGCTTCACAGGCAAGTAACA 
TCATCAACAAGGCAAACAAAGTACACGAGTGTTCCATCTGCGGTTCTGAGTTCACTTCCGGG 
CAAGCTCTCGGTGGTCACATGAGGCGGCACAGGACAGCCGTAACCACGATTAGCCCCGTTGC 
AGCC AC CGC AGAAGTAAGCAGAAAC AG T AC AGAGG AAGAGAT TG AG AT C AAT AT AGGC CGTT 
CGAT GGAACAGCAGAGGAAAT ATCT ACCGTT GGATCT TAATC TACC AGCACC AGGAGATGAT 
CT AAG AGAGT CC AAGT T T CAAGG GAT AGT AT T C T C AGC AAC ACC AG CG T T AAT AGATT GT C A 
T TACT AGT TGTTTTTTT TAC T ACAT AAT ATGAT GAAAT ATT TG TG AAT T C T T CT TAC T TACT 
ACTATATTGTTGATCAAAAAAAAAAAAAAAAAA 
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SEQ ID NO 50: gi_1418339_ref_CAA67236 . 1, 2xC2H2, Arabidopsis 
thaliana 

MG QDEVGS DQTQ 1 1 KGKRTKRQRS S S T FWTAAT T VT S T S S S AGGS GGERAVS DE YNS AVS S 
PVTTDCTQEEEDMAICLIMLARGTVLPSPDLKNSRKIHQKISSENSSFYVYECKTCNRTFSS 
FQALGGHRASHKKPRTSTEEKTRLPLTQPKSSASEEGQNSHFKVSGSALASQASNIINKANK 
VHEC S I CG SE FT SGQALGGHMRRHRTAVT T I S PVAAT AE VSRNS TEEE IE I N I GRSMEQQRK 
YLPLDLNLPAPGDDLRESKFQGIVFSATPALIDCHY 
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SEQUENCE LISTING 

<110> CropDesign N.V. 

<120> Plants having modified growth characteristics and a method for 
making the same 

<130> CD-070-PCT 

<160> 50 

<170> Patentln version 3.1 

<210> 1 
<211> 692 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 1 

aatggcgctc gaggctctta catcaccaag attagcttct ccgattcctc ctttgttcga 60 

agattcttca gtcttccatg gagtcgagca ctggacaaag ggtaagcgat ctaagagatc 120 

aagatccgat ttccaccacc aaaacctcac tgaggaagag tatctagctt tttgcctcat 180 

gcttctcgct cgcgacaacc gtcagcctcc tcctcctccg gcggtggaga agttgagcta 240 

caagtgtagc gtctgcgaca agacgttctc ttcttaccaa gctctcggtg gtcacaaggc 300 

aagccaccgt aagaacttat cacagactct ctccggcgga ggagatgatc attcaacctc 360 

gtcggcgaca accacatccg ccgtgactac tggaagtggg aaatcacacg tttgcaccat 420 

ctgtaacaag tcttttcctt ccggtcaagc tctcggcgga cacaagcggt gccactacga 480 

aggaaacaac aacatcaaca ctagtagcgt gtccaactcc gaaggtgcgg ggtccactag 540 

ccacgttagc agtagccacc gtgggtttga cctcaacatc cctccgatcc ctgaattctc 600 

gatggtcaac ggagacgacg aagtcatgag ccctatgccg gcgaagaagc ctcggtttga 660 

ctttccggtc aaacttcaac tttaaggaaa tt 692 

<210> 2 
<211> 227 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 2 

Met Ala Leu Glu Ala Leu Thr Ser Pro Arg Leu Ala Ser Pro lie Pro 
15 10 15 

Pro Leu Phe Glu Asp Ser Ser Val Phe His Gly Val Glu His Trp Thr 
20 25 30 

Lys Gly Lys Arg Ser Lys Arg Ser Arg Ser Asp Phe His His Gin Asn 
35 40 45 

Leu Thr Glu Glu Glu Tyr Leu Ala Phe Cys Leu Met Leu Leu Ala Arg 
50 55 60 

Asp Asn Arg Gin Pro Pro Pro Pro Pro Ala Val Glu Lys Leu Ser Tyr 
65 = 70 75 80 

Lys Cys Ser Val Cys Asp Lys Thr Phe Ser Ser Tyr Gin Ala Leu Gly 
85 90 95 



Gly His Lys Ala Ser His Arg Lys Asn Leu Ser Gin Thr Leu Ser Gly 
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100 

Gly Gly Asp Asp His Ser Thr Ser 
115 120 

Thr Thr Gly Ser Gly Lys Ser His 
130 135 

Phe Pro Ser Gly Gin Ala Leu Gly 
145 150 

Gly Asn Asn Asn lie Asn Thr Ser 
165 

Gly Ser Thr Ser His Val Ser Ser 
180 

lie Pro Pro lie Pro Glu Phe Ser 
195 200 

Met Ser Pro Met Pro Ala Lys Lys 
210 215 



105 110 

Ser Ala Thr Thr Thr Ser Ala Val 
125 

Val Cys Thr He Cys Asn Lys Ser 
140 

Gly His Lys Arg Cys His Tyr Glu 
155 160 

Ser Val Ser Asn Ser Glu Gly Ala 
170 175 

Ser His Arg Gly Phe Asp Leu Asn 
185 190 

Met Val Asn Gly Asp Asp Glu Val 
205 

Pro Arg Phe Asp Phe Pro Val Lys 
220 



Leu Gin Leu 

225 

<210> 3 
<211> 50 
<212> DNA 

<213> Artificial sequence 



<220> 

<223> primer PRM3204 



<4 00> 3 

ggggacaagt ttgtacaaaa aagcaggctt cacaatggcg ctcgaggctc 

<210> 4 

<211> 53 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> primer PRM3205 



<400> 4 

ggggaccact ttgtacaaga aagctgggta atttccttaa agttgaagtt tga 

<210> 5 

<211> 6 

<212> PRT 

<213> Artificial sequence 



<220> 

<223> QALGGH motif 
<400> 5 

Gin Ala Leu Gly Gly His 
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1 5 

<210> 6 

<211> 6 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> NNM box 
<220> 

<221> MIS COFEATURE 

<222> (3).. (3) 

<223> Xaa can be either methionine or tryptophan 

<400> 6 

Asn Asn Xaa Gin Met His 
1 5 

<210> 7 

<211> 7 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> EAR motif 
<220> 

<221> MISC_FEATURE 

<222> (1)..(1> 

<223> Xaa can be any hydrophobic amino acid (Ala, Cys, Phe, Gly, His, I 
le, Lys, Leu, Met, Arg, Thr, Val, Trp or Tyr) 

<220> 

<221> MISC_FEATURE 

<222> (5).. (5) 

<223> Xaa can be any hydrophobic amino acid (Ala, Cys, Phe, Gly, His, I 
le, Lys, Leu, Met, Arg, Thr, Val, Trp or Tyr) 

<220> 

<221> MI S COFEATURE 
<222> (6).. (6) 

<223> Xaa can be any amino acid or no amino acid 

<400> 7 

Xaa Asp Leu Asn Xaa Xaa Pro 
1 5 



<210> 


8 


<211> 


7 


<212> 


PRT 


<213> 


Artificial sequence 


<220> 




<223> 


B-Box 


<220> 




<221> 


MISC FEATURE 
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<222> (3).. (3) 

<223> Ser can be serine or no amino acid 
<220> 

<221> MIS COFEATURE 

<222> (6).. (6) 

<223> Xaa can be any amino acid 

<400> 8 

Lys Arg Ser Lys Arg Xaa Arg 
1 5 

<210> 9 

<211> 12 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> L-Box 
<220> 

<2 2 1> MI SCJFE ATURE 

<222> (2) . . (2) 

<223> Xaa can be any amino acid 
<220> 

<221> MIS C_FE ATURE 

<222> (4).. (5) 

<223> Xaa can be any amino acid 
<220> 

<221> MI S COFEATURE 

<222> (7).. (7) 

<223> Xaa can be any amino acid 
<220> 

<221> MISC_FEATURE 

<222> (10).. (11) 

<223> Xaa can be any amino acid 

<400> 9 

Glu Xaa Glu Xaa Xaa Ala Xaa Cys Leu Xaa Xaa Leu 
15 10 

<210> 10 
<211> 1006 
<212> DNA 

<213> Datisca glomerata 
<400> 10 

ggcacgagga caaattctct ctctatcctc tgaatatctt tggtttgtga actgagaagc 60 
tattagatgg ctctagaagc gctcaactct ccgaccacag ctacgccggt gtttcactac 120 
gacgacccca gcttgaatta ccttgagcca tggaccaagc gtaagcgttc caagcgtacg 180 
cgcttagata gccccatacc gaggaagagt accttgcttt ctgcctcatc atgctcgctc 240 
gtggccgcgt tgcctctgca aatcgacggg attctcagtc ttccattcag attcagcctg 300 
aagcaacgac ttcggctacc aaagtcagtt ataagtgctc tgtgtgcgat aaggcctttt 360 
cgtcttatca ggctttgggt gggcacaagg ccagccacag aaagctcgct ggcggcgaag 420 
atcaatcgac ttcctttgcc accacgaatt cagccaccgt cactaccacc acagcctccg 480 
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gaggtggtgg 
ccttgggtgg 
accacaacaa 
gttccacaca 
cggagtttcg 
ccaagaaacc 
agtttcttta 
attaatatat 
gctttgctca 



caggtctcat gagtgttcta tttgccacaa atcgttcccg actggccagg 
tcacaagcgc tgccactacg aaggcagtat cggcggcaat agtattcacc 
taccaccaac agcggaagca acggtggcat gagcatgacc tccgaagtag 
cacagtcagc cacagtcacc gtgacttcga tctcaacatc ccggccttgc 
gtcgaatttc ttcatatccg gggatgacga ggtcgagagt cctcatccgg 
ccgtatattg atgaaataaa acatttctca agatcactga accaggcttt 
taggaggaga tttaaaaaag tagtatctct ctttctttat ccgtaggata 
ttcgtgtaca taaatttgta gttctttaac acactctgtt tcattttctt 
actttgtatt ggttatttca ttatgaaaat tcaatt 



540 
600 
660 
720 
780 
840 
900 
960 
1006 



<210> 11 
<211> 247 
<212> PRT 

<213> Datisca glomerata 
<400> 11 

Met Ala Leu Glu Ala Leu Asn Ser Pro Thr Thr Ala Thr Pro Val Phe 
15 10 15 

His Tyr Asp Asp Pro Ser Leu Asn Tyr Leu Glu Pro Trp Thr Lys Arg 
20 25 30 

Lys Arg Ser Lys Arg Thr Arg Leu Asp Ser Pro His Thr Glu Glu Glu 
35 40 45 

Tyr Leu Ala Phe Cys Leu He Met Leu Ala Arg Gly Arg Val Ala Ser 
50 55 60 

Ala Asn Arg Arg Asp Ser Gin Ser Ser He Gin He Gin Pro Glu Ala 
65 " 70 75 80 

Thr Thr Ser Ala Thr Lys Val Ser Tyr Lys Cys Ser Val Cys Asp Lys 
85 90 95 

Ala Phe Ser Ser Tyr Gin Ala Leu Gly Gly His Lys Ala Ser His Arg 
100 105 HO 

Lys Leu Ala Gly Gly Glu Asp Gin Ser Thr Ser Phe Ala Thr Thr Asn 
115 120 125 

Ser Ala Thr Val Thr Thr Thr Thr Ala Ser Gly Gly Gly Gly Arg Ser 
130 135 140 

His Glu Cys Ser He Cys His Lys Ser Phe Pro Thr Gly Gin Ala Leu 
145 150 155 160 

Gly Gly His Lys Arg Cys His Tyr Glu Gly Ser He Gly Gly Asn Ser 
165 170 175 

He His His His Asn Asn Thr Thr Asn Ser Gly Ser Asn Gly Gly Met 
180 185 190 

Ser Met Thr Ser Glu Val Gly Ser Thr His Thr Val Ser His Ser His 
195 200 205 

Arg Asp Phe Asp Leu Asn He Pro Ala Leu Pro Glu Phe Arg Ser Asn 
210 215 220 
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Phe Phe He Ser Gly Asp Asp Glu Val Glu Ser Pro His Pro Ala Lys 
225 230 '235 240 

Lys Pro Arg He Leu Met Lys 
245 

<210> 12 

<211> 996 

<212> DNA 

<213> Glycine max 



<400> 12 

aaaattctca 

ccaaacacat 

ttgacgaccc 

cttctgaaga 

ccgtcaacaa 

atccttccac 

aagcgctcgg 

cccccagcac 

atgagtgctc 

gttgtcacta 

tcgcctcgga 

atctcaacat 

gccctcaccc 

aatttcaatg 

tgagagttac 

gacccagttc 

aatttatttg 



ctctctctct 
ggctttggaa 
aactattcca 
agagtacctc 
ccgccacgtc 
caagctcagt 
tggacacaag 
caccacttcc 
catttgccac 
cgaaggtaac 
aggcgtgggc 
cccggccttt 
tgtcatgaag 
aactcgttga 
tataatttga 
tctggttgag 
tttattttaa 



catctcgaga 
gctctcaact 
tgggcgaaac 
gccctctgcc 
agccctccgc 
tacaaatgct 
gccagtcacc 
tccgccgccg 
aaatccttcc 
ggtaacggaa 
tccacccaca 
ccggattttt 
aagcctcgcc 
atttttagtt 
ttttgtacat 
gttgtacttt 
aaaaaaaaaa 



tcatagtatc 
caccaacaac 
gaaaacgttc 
tcatcatgct 
cgctacagcc 
ccgtttgcga 
ggaaactcgc 
ccaccagctc 
ccaccggaca 
ataacaacaa 
ctgtcagtca 
cgaccaaggt 
tcttcgtcat 
tatttttcga 
agtacttgga 
cacaacagtg 
aaaaaa 



atattcaata 
aaccgctcca 
aaagcgttct 
cgctcgcggc 
acagccacag 
caagagcttc 
cggcgccgcc 
cgcctccgga 
ggcccttggc 
cagtaacagc 
cggccaccac 
cggagaagac 
tcccaagatc 
ctatatattt 
agttttgttg 
gcagatttgc 



tcatttcata 
tcttttccct 
cgcgaccatc 
ggcaccacca 
ccgacaccag 
ccctcttacc 
gaagaccaac 
ggtaaggccc 
ggacacaaac 
gttgtcaccg 
cgcgacttcg 
gaggttgaga 
gaaatccccc 
tggagaattt 
gaccgtaccg 
aattcaattc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
996 



<210> 13 

<211> 240 

<212> PRT 

<213> Glycine max 

<400> 13 

Met Ala Leu Glu Ala Leu Asn Ser Pro Thr Thr Thr Ala Pro Ser Phe 
15 10 15 

Pro Phe Asp Asp Pro Thr He Pro Trp Ala Lys Arg Lys Arg Ser Lys 
20 25 30 

Arg Ser Arg Asp His Pro Ser Glu Glu Glu Tyr Leu Ala Leu Cys Leu 
35 40 45 

He Met Leu Ala Arg Gly Gly Thr Thr Thr Val Asn Asn Arg His Val 
50 55 60 



Ser Pro Pro Pro Leu Gin Pro Gin Pro Gin Pro Thr Pro Asp Pro Ser 
65 70 75 80 

Thr Lys Leu Ser Tyr Lys Cys Ser Val Cys Asp Lys Ser Phe Pro Ser 
85 90 95 

Tyr Gin Ala Leu Gly Gly His Lys Ala Ser His Arg Lys Leu Ala Gly 
100 105 HO 



Ala Ala Glu Asp Gin Pro Pro Ser Thr Thr Thr Ser Ser Ala Ala Ala 
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115 120 125 

Thr Ser Ser Ala Ser Gly Gly Lys Ala His Glu Cys Ser He Cys His 
130 135 140 

Lys Ser Phe Pro Thr Gly Gin Ala Leu Gly Gly His Lys Arg Cys His 
145 150 155 160 

Tyr Glu Gly Asn Gly Asn Gly Asn Asn Asn Asn Ser Asn Ser Val Val 
165 170 175 

Thr Val Ala Ser Glu Gly Val Gly Ser Thr His Thr Val Ser His Gly 
180 185 190 

His His Arg Asp Phe Asp Leu Asn He Pro Ala Phe Pro Asp Phe Ser 

195 ' 200 205 

Thr Lys Val Gly Glu Asp Glu Val Glu Ser Pro His Pro Val Met Lys 
210 215 220 

Lys Pro Arg Leu Phe Val lie Pro Lys He Glu lie Pro Gin Phe Gin 
225 230 235 240 

<210> 14 
<211> 1006 
<212> DNA 

<213> Medicago sativa 
<400> 14 

aattcggcac gagaaataac cacttctctc tcaaaacctc cttttgcctt ttgcttctac 60 

tttcacttgc gtaacgctaa ctaactcttc tcgagtgttc ttcttttcat catatggcta 120 

tggaagcact taactcaccc accactgcta ctcctttcac accctttgag gaaccaaatc 180 

tgagttatct tgaaacaccg tggacgaaag gtaaacgatc aaagcgttct cgcatggatc 240 

aatcttcatg cactgaagaa gagtatctcg ctctttgtct catcatgctt gctcgcagcg 300 

gtaacaacaa cgacaaaaag tctgattcgg tggcgacgcc gctaaccacc gttaaactca 360 

gtcacaaatg ctcagtctgc aacaaagctt tctcatctta tcaagcccta ggtggacaca 420 

aagccagtca ccggaaagct gttatgtccg caaccaccgc tgaagatcag atcaccacca 480 

cttcatccgc cgtgactacc agctctgctt ccaacggtaa gaacaagact catgagtgtt 540 

ccatctgtca caaatccttc cctactggac aggctttggg aggacacaag cgttgtcact 600 

acgaaggcag cgttggtgcc ggtgccggtg ctggaagtaa cgctgtaact gcctctgaag 660 

gagttggatt gtcacacagc caccaccgtg attttgatct taacctcccg gcttttccgg 720 

acttttcaaa gaagtttttc gtggatgacg aggtttttag tcctttacct gctgcaaaga 7 80 

agccctgtct tttcaagctg gaaattcctt ctcattactg atcaataata gatccaattt 84 0 

tattgttatt attattaata attattatcg cttagggcat agttattttc ttttttcttt 900 

caattatttc ggatcaattt gttctgtaca tacaaattgg gattggtttt agaatttagg 960 

acggttgtag acaatggaaa ttcaattcaa ttatttaatt ttgtgt 1006 

<210> 15 
<211> 235 
<212> PRT 

<213> Medicago sativa 
<400> 15 

Met Ala Met Glu Ala Leu Asn Ser Pro Thr Thr Ala Thr Pro Phe Thr 
15 10 15 

Pro Phe Glu Glu Pro Asn Leu Ser Tyr Leu Glu Thr Pro Trp Thr Lys 
20 25 30 
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Gly Lys Arg Ser Lys Arg Ser Arg Met Asp Gin Ser Ser Cys Thr Glu 
35 40 45 

Glu Glu Tyr Leu Ala Leu Cys Leu lie Met Leu Ala Arg Ser Gly Asn 
50 55 60 

Asn Asn Asp Lys Lys Ser Asp Ser Val Ala Thr Pro Leu Thr Thr Val 
65 70 75 80 

Lys Leu Ser His Lys Cys Ser Val Cys Asn Lys Ala Phe Ser Ser Tyr 
85 90 95 

Gin Ala Leu Gly Gly His Lys Ala Ser His Arg Lys Ala Val Met Ser 
100 105 110 

Ala Thr Thr Ala Glu Asp Gin lie Thr Thr Thr Ser Ser Ala Val Thr 
115 120 125 

Thr Ser Ser Ala Ser Asn Gly Lys Asn Lys Thr His Glu Cys Ser lie 
130 135 140 

Cys His Lys Ser Phe Pro Thr Gly Gin Ala Leu Gly Gly His Lys Arg 
145 150 155 160 

Cys His Tyr Glu Gly Ser Val Gly Ala Gly Ala Gly Ala Gly Ser Asn 
165 170 175 

Ala Val Thr Ala Ser Glu Gly Val Gly Leu Ser His Ser His His Arg 
180 185 190 

Asp Phe Asp Leu Asn Leu Pro Ala Phe Pro Asp Phe Ser Lys Lys Phe 
195 200 205 

Phe Val Asp Asp Glu Val Phe Ser Pro Leu Pro Ala Ala Lys Lys Pro 
210 ' 215 220 

Cys Leu Phe Lys Leu Glu lie Pro Ser His Tyr 
225 230 235 

<210> 16 
<211> 1061 
<212> DNA 

<213> Nicotiana tabacum 
<400> 16 

ttttccctcg aatttgataa ctaaagagaa tattatgact cttgaagctt tgaagtcacc 
tacggcggca acgccgactc taccaccacg ctatgaagat gatgatgaaa ttcataattt 120 
ggattcttgg gctaaaggaa aacgatcaaa acggccccgt attgatgccc caccgactga 180 
agaagagtat ttagccctct gtctcatcat gctcgctcgc agcggaaccg gaaccagaac 24 0 
cggtttaact gatgctacta cttcccaaca acctgccgat aaaaaaaccg ccgagttgcc 300 
gccggttcat aagaaagagg tggcaacaga gcaagcagag caatcttaca agtgtagcgt 360 
gtgtgacaag gctttttctt cttatcaagc actcggtggg cataaagcaa gtcaccgtaa 420 
aactactact actgctaccg ccgcctctga tgataacaat ccttcaactt caacttccac 480 
tggcgccgtt aatatctctg ctcttaatcc aactggtcgt tcacacgtct gttctatttg 54 0 
ccacaaggct tttcctactg gccaagcttt gggtgggcac aagcgccgcc actatgaagg 600 
caaactcggt ggtaacagcc gcgacttagg cggcggcggc ggcggcggtc atagtggaag 660 
cgtcttgact acttcagacg gcggcgcgtc gactcacacg ctacgtgact ttgacctgaa 720 



60 
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catgcctgct tcgccggaat tgcaactggg tctgagtatt gattgtggac ggaaaagtca 780 

actgttgccg atggtccaag aggtggaaag tcctatgcct gcaaagaaac cgcgtttatt 840 

gttttcgttg ggttgaaact tctttagggg aattgaattg attgtgtttt agccaaatta 900 

gtaaattggt tcatgtgatt ttatttttag gaaaaggaat tattgattgt tttacccgtt 960 

tattcttagg gtggtattat gtacagggag tgaatcattc attggtttta cactttctta 1020 

attatatatt cttttttttt acacataaaa aaaaaaaaaa a 1061 

<210> 17 
<211> 273 
<212> PRT 

<213> Nicotiana tabacum 
<400> 17 

Met Thr Leu Glu Ala Leu Lys Ser Pro Thr Ala Ala Thr Pro Thr Leu 
15 10 15 

Pro Pro Arg Tyr Glu Asp Asp Asp Glu lie His Asn Leu Asp Ser Trp 
20 25 30 

Ala Lys Gly Lys Arg Ser Lys Arg Pro Arg lie Asp Ala Pro Pro Thr 
35 40 45 

Glu Glu Glu Tyr Leu Ala Leu Cys Leu He Met Leu Ala Arg Ser Gly 
50 55 60 

Thr Gly Thr Arg Thr Gly Leu Thr Asp Ala Thr Thr Ser Gin Gin Pro 
65 70 75 80 

Ala Asp Lys Lys Thr Ala Glu Leu Pro Pro Val His Lys Lys Glu Val 
85 90 95 

Ala Thr Glu Gin Ala Glu Gin Ser Tyr Lys Cys Ser Val Cys Asp Lys 
100 105 HO 

Ala Phe Ser Ser Tyr Gin Ala Leu Gly Gly His Lys Ala Ser His Arg 
115 120 125 

Lys Thr Thr Thr Thr Ala Thr Ala Ala Ser Asp Asp Asn Asn Pro Ser 
130 135 140 

Thr Ser Thr Ser Thr Gly Ala Val Asn lie Ser Ala Leu Asn Pro Thr 
145 150 155 160 

Gly Arg Ser His Val Cys Ser He Cys His Lys Ala Phe Pro Thr Gly 
165 170 175 

Gin Ala Leu Gly Gly His Lys Arg Arg His Tyr Glu Gly Lys Leu Gly 
180 185 190 

Gly Asn Ser Arg Asp Leu Gly Gly Gly Gly Gly Gly Gly His Ser Gly 
195 200 205 

Ser Val Leu Thr Thr Ser Asp Gly Gly Ala Ser Thr His Thr Leu Arg 
210 215 220 

Asp Phe Asp Leu Asn Met Pro Ala Ser Pro Glu Leu Gin Leu Gly Leu 
225 230 235 240 
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Ser lie Asp Cys Gly Arg Lys Ser Gin Leu Leu Pro Met Val Gin Glu 
245 250 255 

Val Glu Ser Pro Met Pro Ala Lys Lys Pro Arg Leu Leu Phe Ser Leu 
260 265 270 

Gly 



<210> 18 

<211> 1213 

<212> DNA 

<213> Oryza sativa 



<400> 18 

aattcggcac 

cgaagcttag 

gtggaagcat 

agcagcagca 

gggaggaggg 

gatcggagga 

gcgtccaggc 

agtgctccgt 

gccaccgggt 

ccgccttgct 

ccgacggcat 

ggcaggcgct 

gcgcatcttc 

gcaacggcca 

tgtggcggcc 

ccctcgcctt 

tccgtcagag 

ttttgtgccg 

agccatacat 

gaactgtacc 

aaaaaaaaaa 



gaggccacac 
ctagcggata 
gtcgagcgcg 
gcacgaggtg 
cggacacctg 
ggagaacctc 
gccgcctccg 
ctgcggcaag 
caagctgccg 
gccttccgcc 
gaccaacaga 
cggcgggcac 
aaccgagctc 
gtccgccacc 
gtgctccaag 
caagaagccc 
tttttgtcta 
ctattgttta 
acagtcatac 
tgtatctttt 
aaa 



agcaaccagc 
gaacaagtcg 
tcgtccatgg 
gaggaggcga 
ccccaggggt 
gcgctctgcc 
ctctcggctt 
tccttcagct 
actccgcccg 
gaggaccgcg 
gtccacaggt 
aagaggaagc 
ctggccacgg 
cgggcgttcg 
ggcaagaaga 
cggcttctca 
gggagtgaaa 
atttgttcct 
agatataggt 
acactttgtt 



cagctgccac 
tcgatctgct 
aagcgctcca 
cggtcgtgac 
gggcgaagcg 
tcctcatgct 
cggcgccccc 
cctaccaggc 
cagctcccgt 
agccagccac 
gttccatctg 
actacgacgg 
tggccgccga 
acctcaacct 
tgtgggacga 
ccgcgtaatt 
ttcagtcgaa 
gcttttgtac 
ctagctcttc 
ctttgacagt 



actagcttga 
tgctgctttt 
cgccgcggtg 
gagcagcagc 
gaagcggtcg 
cgcccgcggc 
gccggcaggt 
gctcggcggc 
cttggctccc 
gtcatccacc 
ccagaaggag 
tggcgtaggc 
gtccgaggtg 
cccggccgtg 
ggaggaggag 
cagcagctgc 
acacactatt 
agagcaagcg 
cttggttctt 
catatattgt 



ggcgagcgag 
gtgaattgcg 
ctcaaggagg 
gccacgagcg 
cgccgccagc 
ggccaccacc 
gcggagttca 
cacaagacga 
gcccccgtcg 
gccgcgtcct 
ttccccaccg 
gccggcgccg 
ggaagctccg 
ccggagttcg 
gtccagagcc 
acggatccga 
cgttgattcg 
agtgatacat 
tgtaacactg 
agaccaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1213 



<210> 19 

<211> 269 

<212> PRT 

<213> Oryza sativa 

<400> 19 

Met Ser Ser Ala Ser Ser Met Glu Ala Leu His Ala Ala Val Leu Lys 
15 10 15 

Glu Glu Gin Gin Gin His Glu Val Glu Glu Ala Thr Val Val Thr Ser 
20 25 30 

Ser Ser Ala Thr Ser Gly Glu Glu Gly Gly His Leu Pro Gin Gly Trp 
35 40 45 



Ala Lys Arg Lys Arg Ser Arg Arg Gin Arg Ser Glu Glu Glu Asn Leu 
50 55 60 

Ala Leu Cys Leu Leu Met Leu Ala Arg Gly Gly His His Arg Val Gin 
65 70 75 80 
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Ala Pro Pro Pro Leu Ser Ala Ser Ala Pro Pro Pro Ala Gly Ala Glu 
85 90 95 

Phe Lys Cys Ser Val Cys Gly Lys Ser Phe Ser Ser Tyr Gin Ala Leu 
100 105 110 

Gly Gly His Lys Thr Ser His Arg Val Lys Leu Pro Thr Pro Pro Ala 
115 120 125 

Ala Pro Val Leu Ala Pro Ala Pro Val Ala Ala Leu Leu Pro Ser Ala 
130 135 140 

Glu Asp Arg Glu Pro Ala Thr Ser Ser Thr Ala Ala Ser Ser Asp Gly 
145 " 150 155 160 

Met Thr Asn Arg Val His Arg Cys Ser lie Cys Gin Lys Glu Phe Pro 
165 170 175 

Thr Gly Gin Ala Leu Gly Gly His Lys Arg Lys His Tyr Asp Gly Gly 
180 185 190 

Val Gly Ala Gly Ala Gly Ala Ser Ser Thr Glu Leu Leu Ala Thr Val 



Arg Ala Phe Asp Leu Asn Leu Pro Ala Val Pro Glu Phe Val Trp Arg 
225 230 235 240. 

Pro Cys Ser Lys Gly Lys Lys Met Trp Asp Glu Glu Glu Glu Val Gin 
245 250 255 

Ser Pro Leu Ala Phe Lys Lys Pro Arg Leu Leu Thr Ala 
260 265 

<210> 20 
<211> 1020 
<212> DNA 

<213> Petunia x hybrida 
<400> 20 

ttcactcacc aaaacaactt ctctacctct tctacttgca cattcaaatt ctttcattac 60 

tacttatctc tactaatctt gattcgattt tagtaaatca aacaagagaa tcttttcagt 120 

aatacaaaca agaaaatttt ctctctatac ttgattgagt ttagtaaggc aaacaagaaa 180 

actatcatgg cacttgaagc attgaattct ccaactacaa caacaccacc atcattccaa 240 

tttgagaaca acgggcttaa gtaccttgag agttggacaa aaggtaaaag atcaaaaagg 300 

caacgcagca tggaacgaca gtgtactgaa gaagagtatt tagcactttg tcttatcatg 360 

ctagcacgta gcgatggttc tgttaataac tcacggtctc taccaccacc accactacca 420 

ccatcagttc cagtaacgtc gcaaataaac gcgacgttat tggaacagaa gaatttgtac 480 

aagtgttccg tttgtggtaa agggtttggg tcttatcaag ctttaggtgg acataaagca 540 

agtcaccgga aacttgtcag catgggagga gatgaacaat ctactacttc cactactact 600 

aacgtaacgg gaactagttc cgctaacgtt aacggtaacg gaagaactca cgaatgttca 660 

atttgtcaca agtgctttcc tactggacaa gctttaggtg gtcataaaag gtgccactat 720 

gacggtggta acggtaacgg taacggaagt gtaagtgttg gggtgacgtc atctgaaggt 780 

gtggggtcca ctattagtca tcaccgtgac tttgacttga atattcccgc gttgccggag 840 

ttttggccgg gatttggttc cggcgaggat gaggtggaga gtcctcatcc agcaaagaag 900 
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tcaaggctat ctcttccacc taaacttgaa ttattcaaag gattatagag ggaatattga 960 
tttgttacag gaagatttat taggattcac gaattttttg ttgactagtt tatgtaatat 1020 

<210> 21 
<211> 253 
<212> PRT 

<213> Petunia x hybrida 
<400> 21 

Met Ala Leu Glu Ala Leu Asn Ser Pro Thr Thr Thr Thr Pro Pro Ser 
15 10 15 

Phe Gin Phe Glu Asn Asn Gly Leu Lys Tyr Leu Glu Ser Trp Thr Lys 
20 25 30 

Gly Lys Arg Ser Lys Arg Gin Arg Ser Met Glu Arg Gin Cys Thr Glu 
35 40 45 

Glu Glu Tyr Leu Ala Leu Cys Leu lie Met Leu Ala Arg Ser Asp Gly 
50 55 60 

Ser Val Asn Asn Ser Arg Ser Leu Pro Pro Pro Pro Leu Pro Pro Ser 
65 70 75 80 

Val Pro Val Thr Ser Gin lie Asn Ala Thr Leu Leu Glu Gin Lys Asn 
85 90 95 

Leu Tyr Lys Cys Ser Val Cys Gly Lys Gly Phe Gly Ser Tyr Gin Ala 
100 105 110 

Leu Gly Gly His Lys Ala Ser His Arg Lys Leu Val Ser Met Gly Gly 
115 120 125 

Asp Glu Gin Ser Thr Thr Ser Thr Thr Thr Asn Val Thr Gly Thr Ser 
130 135 140 

Ser Ala Asn Val Asn Gly Asn Gly Arg Thr His Glu Cys Ser lie Cys 
145 150 155 160 

His Lys Cys Phe Pro Thr Gly Gin Ala Leu Gly Gly His Lys Arg Cys 
165 17 0 175 

His Tyr Asp Gly Gly Asn Gly Asn Gly Asn Gly Ser Val Ser Val Gly 
180 185 190 

Val Thr Ser Ser Glu Gly Val Gly Ser Thr lie Ser His His Arg Asp 
195 200 205 

Phe Asp Leu Asn lie Pro Ala Leu Pro Glu Phe Trp Pro Gly Phe Gly 
210 215 220 

Ser Gly Glu Asp Glu Val Glu Ser Pro His Pro Ala Lys Lys Ser Arg 
225 230 235 240 

Leu Ser Leu Pro Pro Lys Leu Glu Leu Phe Lys Gly Leu 
245 250 
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60 



786 



<210> 22 
<211> 786 
<212> DNA 

<213> Triticum aestivum 
<400> 22 

atgtcgtcgt cggccatgga agcgctccac gccctgatcc cggagcagca ccagctggac 
gttgaggcgg ctgcggctgt cagcagcgcc accagcggcg aggagagcgg ccacgtgctg 120 
caggggtggg ccaagaggaa gcgatcgcgc cgccagcgct ccgaggagga gaacctcgcg 180 
ctctgcctcc tcatgctctc gcgcggcggc aagcagcgtg ttcaggcgcc gcagccggag 240 
tcgttcgctg cgccggtgcc tgccgagttc aagtgctccg tctgcggcaa gtccttcagc 300 
tcctaccagg cgctcggagg ccacaagacg agccaccggg tgaagcagcc gtctcctccc 360 
tctgatgccg ctgctgcccc actcgtggcc ctcccggccg tcgccgccat cctgccgtcc 420 
gccgagccgg ccacgtcgtc caccgccgcg tcctccgacg gcgcgaccaa cagagtccac 480 
aggtgctcca tctgccaaaa ggagttcccg actgggcagg cgctcggcgg gcacaagagg 540 
aagcactacg acggaggcgt gggcgccgcc gcctcgtcga ccgagcttct ggccgccgcg 600 
gccgccgagt ctgaggtggg gagcaccggc aacgggagct ccgccgcccg ggccttcgac 660 
ctgaacattc cggccgtgcc ggagttcgtg tggaggccgt gcgccaaggg caagatgatg 720 
tgggaggacg atgaggaggt gcagagcccc ctcgccttca agaagcctcg gcttctcacc 780 
gcttga 

<210> 23 
<211> 261 
<212> PRT 

<213> Triticum aestivum 
'<400> 23 

Met Ser Ser Ser Ala Met Glu Ala Leu His Ala Leu lie Pro Glu Gin 
15 10 15 

His Gin Leu Asp Val Glu Ala Ala Ala Ala Val Ser Ser Ala Thr Ser 
20 25 30 

Gly Glu Glu Ser Gly His Val Leu Gin Gly Trp Ala Lys Arg Lys Arg 
35 40 45 

Ser Arg Arg Gin Arg Ser Glu Glu Glu Asn Leu Ala Leu Cys Leu Leu 
50 55 60 

Met Leu Ser Arg Gly Gly Lys Gin Arg Val Gin Ala Pro Gin Pro Glu 
65 70 75 80 

Ser Phe Ala Ala Pro Val Pro Ala Glu Phe Lys Cys Ser Val Cys Gly 
85 90 95 

Lys Ser Phe Ser Ser Tyr Gin Ala Leu Gly Gly His Lys Thr Ser His 
100 105 110 

Arg Val Lys Gin Pro Ser Pro Pro Ser Asp Ala Ala Ala Ala Pro Leu 
115 120 125 

Val Ala Leu Pro Ala Val Ala Ala He Leu Pro Ser Ala Glu Pro Ala 
130 135 140 

Thr Ser Ser Thr Ala Ala Ser Ser Asp Gly Ala Thr Asn Arg Val His 
145 150 155 160 



Arg Cys Ser He Cys Gin Lys Glu Phe Pro Thr Gly Gin Ala Leu Gly 
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165 



170 



175 



Gly His Lys Arg Lys His Tyr Asp Gly Gly Val Gly Ala Ala Ala Ser 
180 185 190 

Ser Thr Glu Leu Leu Ala Ala Ala Ala Ala Glu Ser Glu Val Gly Ser 
195 200 205 

Thr Gly Asn Gly Ser Ser Ala Ala Arg Ala Phe Asp Leu Asn lie Pro 
210 215 220 

Ala Val Pro Glu Phe Val Trp Arg Pro Cys Ala Lys Gly Lys Met Met 
225 230 235 240 

Trp Glu Asp Asp Glu Glu Val Gin Ser Pro Leu Ala Phe Lys Lys Pro 
245 250 255 

Arg Leu Leu Thr Ala 
260 

<210> 24 
<211> 1026 
<212> DNA 

<213> Capsicum annum 



<400> 24 

aaaatcttcg 

aacggatatc 

accgtttcaa 

aaagagatct 

gctttgtttg 

accgccggct 

gaaggagaag 

tttaggtgga 

tacctccaca 

aagtggaagg 

aggtggacac 

cgttagtgct 

ggatttcgac 

agatgaggtg 

tgaattattt 

attatactat 

ctatagacct 

aaaaaa 



ctacttactt 
gatatggcac 
tttgagagcg 
aaaaggtcac 
atcatgcttg 
ccggtgatga 
atggtgtata 
cacaaagcta 
accactaacg 
actcacgagt 
aaaaggtgtc 
agcgttggag 
ttgaacattc 
gagagtccac 
caacattaat 
gtaagtttta 
agctagttac 



acatcttcta 
ttgaagcttt 
acggccaaca 
gcagcatgga 
cacgtagcgg 
aactgcacgc 
agtgttcggt 
gtcaccggaa 
caaccggaac 
gttcgatttg 
actacgacgg 
tgacgtcatc 
cggcgttgcc 
atccggcgaa 
gggaatttga 
attttcattg 
taatagcaaa 



gaatagtcac 
gaattctcca 
gcttcgatat 
gcaccagcct 
tggctccgtt 
gccgtcgtca 
ttgtggtaag 
actcgtaccc 
aacaacctcc 
tcacaagtgt 
cggtatcggt 
ggagggtgtg 
ggaattctgg 
gaaatcgcgg 
ttgttaggat 
tgggtgggag 
aattcaattg 



tagaaccagt 


aactttatac 


60 


actggtacac 


caactccgcc 


120 


atcgaaaact 


ggaggaaggg 


180 


actgaggaag 


aatacttagc 


240 


aatcatcaac 


gatctctacc 


300 


tcatcggcgg 


cggaggagga 


360 


ggatttgggt 


cttatcaagc 


420 


ggcggagatg 


atcagtcaac 


480 


gttaacggca 


acggcaacag 


540 


tttcccactg 


gacaagcttt 


600 


aacggaaacg 


ctaacagtgg 


660 


gggtccacag 


tcagtcaccg 


720 


ctgggatttg 


gttccggcga 


780 


ttatgtttgc 


ctccaaaata 


840 


ttactatttt 


ggtagacaaa 


900 


caaaattttt 


aattttttgt 


960 


attgatttaa 


aaaaaaaaaa 


1020 






1026 



<210> 25 
<211> 261 
<212> PRT 

<213> Capsicum annum 
<400> 25 

Met Ala Leu Glu Ala Leu Asn Ser Pro Thr Gly Thr Pro Thr Pro Pro 
15 10 15 



Pro Phe Gin Phe Glu Ser Asp Gly Gin Gin Leu Arg Tyr lie Glu Asn 
20 25 30 



Trp Arg Lys Gly Lys Arg Ser Lys Arg Ser Arg Ser Met Glu His Gin 
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35 40 45 

Pro Thr Glu Glu Glu Tyr Leu Ala Leu Cys Leu He Met Leu Ala Arg 
• 50 55 60 

Ser Gly Gly Ser Val Asn His Gin Arg Ser Leu Pro Pro Pro Ala Pro 
65 ~ 70 75 80 

Val Met Lys Leu His Ala Pro Ser Ser Ser Ser Ala Ala Glu Glu Glu 
85 90 95 

Lys Glu Lys Met Val Tyr Lys Cys Ser Val Cys Gly Lys Gly Phe Gly 
100 105 HO 

Ser Tyr Gin Ala Leu Gly Gly His Lys Ala Ser His Arg Lys Leu Val 
115 120 125 

Pro Gly Gly Asp Asp Gin Ser Thr Thr Ser Thr Thr Thr Asn Ala Thr 
130 135 140 

Gly Thr Thr Thr Ser Val Asn Gly Asn Gly Asn Arg Ser Gly Arg Thr 
145 150 155 160 

His Glu Cys Ser He Cys His Lys Cys Phe Pro Thr Gly Gin Ala Leu 
165 170 175 

Gly Gly His Lys Arg Cys His Tyr Asp Gly Gly He Gly Asn Gly Asn 
180 185 190 

Ala Asn Ser Gly Val Ser Ala Ser Val Gly Val Thr Ser Ser Glu Gly 
195 200 205 

Val Gly Ser Thr Val Ser His Arg Asp Phe Asp Leu Asn He Pro Ala 
210 215 220 

Leu Pro Glu Phe Trp Leu Gly Phe Gly Ser Gly Glu Asp Glu Val Glu 
225 230 235 240 

Ser Pro His Pro Ala Lys Lys Ser Arg Leu Cys Leu Pro Pro Lys Tyr 
245 250 255 

Glu Leu Phe Gin His 
260 

<210> 26 
<211> 1068 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 26 

acttcactct ctaatttcct tctctctatc tctcaccata ttcgcgatta aaaactctca 60 
acttttctct caaatttctg atcctttgat ccaacagtta gaagaagatt catctgatca 120 
tggccctcga agcgatgaac actccaactt cttctttcac cagaatcgaa acgaaagaag 180 
atttgatgaa cgacgccgtt ttcattgagc cgtggcttaa acgcaaacgc tccaaacgtc 240 
agcgttctca cagcccttct tcgtcttctt cctcaccgcc tcgatctcga cccaaatccc 300 
agaatcaaga tcttacggaa gaagagtatc tcgctctttg tctcctcatg ctcgctaaag 360 
atcaaccgtc gcaaacgcga tttcatcaac agtcgcaatc gttaacgccg ccgccagaat 420 
caaagaacct tccgtacaag tgtaacgtct gtgaaaaagc gtttccttcc tatcaggctt 480 
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taggcggtca caaagcaagt caccgaatca aaccaccaac cgtaatctca acaaccgccg 540 

atgattcaac agctccgacc atctccatcg tcgccggaga aaaacatccg attgctgcct 600 

ccggaaagat ccacgagtgt tcaatctgtc ataaagtgtt tccgacgggt caagctttag 660 

gcggtcacaa acgttgtcac tacgaaggca acctcggcgg cggaggagga ggaggaagca 720 

aatcaatcag tcacagtgga agcgtgtcga gcacggtatc ggaagaaagg agccaccgtg 780 

gattcatcga tctaaaccta ccggcgttac ctgaactcag ccttcatcac aatccaatcg 840 

tcgacgaaga gatcttgagt ccgttgaccg gtaaaaaacc gcttttgttg accgatcacg 900 

accaagtcat caagaaagaa gatttatctt taaaaatcta atactcgact attaattctt 960 

gtgtgatttt tttcgttaca accatagttt cattttcatt tttttagtta caaattttta 1020 

attgttctga tttggattga atattggtat attgttaggg gttgatac 1068 

<210> 27 
<211> 273 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 27 

Met Ala Leu Glu Ala Met Asn Thr Pro Thr Ser Ser Phe Thr Arg lie 
15 10 15 

Glu Thr Lys Glu Asp Leu Met Asn Asp Ala Val Phe lie Glu Pro Trp 
20 25 30 

Leu Lys Arg Lys Arg Ser Lys Arg Gin Arg Ser His Ser Pro Ser Ser 
35 " 40 45 

Ser Ser Ser Ser Pro Pro Arg Ser Arg Pro Lys Ser Gin Asn Gin Asp 
50 55 60 

Leu Thr Glu Glu Glu Tyr Leu Ala Leu Cys Leu Leu Met Leu Ala Lys 
65 70 75 80 

Asp Gin Pro Ser Gin Thr Arg Phe His Gin Gin Ser Gin Ser Leu Thr 
85 90 95 

Pro Pro Pro Glu Ser Lys Asn Leu Pro Tyr Lys Cys Asn Val Cys Glu 
100 105 110 

Lys Ala Phe Pro Ser Tyr Gin Ala Leu Gly Gly His Lys Ala Ser His 
115 ~ 120 125 

Arg lie Lys Pro Pro Thr Val lie Ser Thr Thr Ala Asp Asp Ser Thr 
130 " 135 140 

Ala Pro Thr lie Ser lie Val Ala Gly Glu Lys His Pro He Ala Ala 
145 150 155 160 

Ser Gly Lys He His Glu Cys Ser He Cys His Lys Val Phe Pro Thr 
165 170 175 

Gly Gin Ala Leu Gly Gly His Lys Arg Cys His Tyr Glu Gly Asn Leu 
180 185 190 

Gly Gly Gly Gly Gly Gly Gly Ser Lys Ser He Ser His Ser Gly Ser 
195 200 205 

Val Ser Ser Thr Val Ser Glu Glu Arg Ser His Arg Gly Phe He Asp 
210 215 220 
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Leu Asn Leu Pro Ala Leu Pro Glu Leu Ser Leu His His Asn Pro He 
225 230 235 240 

Val Asp Glu Glu He Leu Ser Pro Leu Thr Gly Lys Lys Pro Leu Leu 
245 250 255 

Leu Thr Asp His Asp Gin Val He Lys Lys Glu Asp Leu Ser Leu Lys 
260 265 270 

He 



<210> 
<211> 
<212> 



28 

976 

DNA 



<213> Arabidopsis thaliana 



<400> 28 

aaatcaaatc 

ttcaagataa 

ctgtttcaag 

agatctgaat 

cttcttgctc 

cctcctctgc 

ttttcgtctt 

actcaatctg 

tccggtggcg 

gccaccggtc 

ggtgtgagta 

ggccaccgtg 

gacgaagagg 

ccctaaacat 

gtatatacaa 

ttgtataatt 

ttgcttggat 



ttttcattta 
tggcacttga 
attcagcact 
tcgaccgtca 
gcgacggaga 
ttcctcctct 
accaggctct 
ccggaggaga 
ggggaggaag 
aagctctcgg 
gtagcgtgtc 
ggtttgacct 
tgatgagtcc 
aaacctagga 
atatcgattt 
gatgtttctg 
acatca 



caattatctt 
aactcttact 
agggtttcat 
gagtctcacg 
tagaaaccgt 
tcctactccg 
tggtggacac 
tgagctgtcg 
tgtgaagtcg 
cggccacaaa 
gaattctgaa 
caacataccg 
tatgccggcg 
aaaactttac 
tgattgatgt 
aaaaagatat 



tcttctcaat 
tctccaagat 
ggaagcaaag 
gaggatgaat 
gaccttgacc 
atctacaagt 
aaggcaagtc 
acatcgtcgg 
cacgtttgct 
cggtgccact 
gatgtggggt 
ccgataccgg 
aagaaactcc 
agaattcatt 
tcttcttcac 
aactttttat 



ttagaactta 
tatcttctcc 
gcaaacgatc 
atatcgcttt 
tgccttcttc 
gtagcgtctg 
accggaaaag 
cgataaccac 
ctatctgtca 
acgaaggaaa 
ctacaagcca 
aattctcgat 
ggtttgactt 
ttataggaaa 
tgaaaaatta 
tgtttcacac 



gtagctagtc 
gatgccgact 
taagcgatca 
atgtctcatg 
ttcgtcttca 
tgacaaggcg 
cttttcgctt 
gtctggtata 
taaatcgttc 
gaacggaggc 
cgtcagcagt 
ggtcaacgga 
cccggagaaa 
ttgttttact 
tgattctttg 
gtatcaaaat 



<210> 29 
<211> 238 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 29 

Met Ala Leu Glu Thr Leu Thr Ser Pro Arg Leu Ser Ser Pro Met Pro 
15 10 15 

Thr Leu Phe Gin Asp Ser Ala Leu Gly Phe His Gly Ser Lys Gly Lys 
20 25 30 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
976 



Arg Ser Lys Arg Ser Arg Ser Glu Phe Asp Arg Gin Ser Leu Thr Glu 
35 40 45 

Asp Glu Tyr He Ala Leu Cys Leu Met Leu Leu Ala Arg Asp Gly Asp 
50 55 60 



Arg Asn Arg Asp Leu Asp Leu Pro Ser Ser Ser Ser Ser Pro Pro Leu 
65 70 75 80 
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Leu Pro Pro Leu Pro Thr Pro lie Tyr Lys Cys Ser Val Cys Asp Lys 
85 90 95 

Ala Phe Ser Ser Tyr Gin Ala Leu Gly Gly His Lys Ala Ser His Arg 
100 105 110 

Lys Ser Phe Ser Leu Thr Gin Ser Ala Gly Gly Asp Glu Leu Ser Thr 
115 120 125 

Ser Ser Ala lie Thr Thr Ser Gly He Ser Gly Gly Gly Gly Gly Ser 
130 135 140 

Val Lys Ser His Val Cys Ser He Cys His Lys Ser Phe Ala Thr Gly 
145 150 155 160 

Gin Ala Leu Gly Gly His Lys Arg Cys His Tyr Glu Gly Lys Asn Gly 
165 170 175 

Gly Gly Val Ser Ser Ser Val Ser Asn Ser Glu Asp Val Gly Ser Thr 
180 185 190 

Ser His Val Ser Ser Gly His Arg Gly Phe Asp Leu Asn He Pro Pro 
195 200 205 

He Pro Glu Phe Ser Met Val Asn Gly Asp Glu Glu Val Met Ser Pro 
210 215 220 

Met Pro Ala Lys Lys Leu Arg Phe Asp Phe Pro Glu Lys Pro 
225 230 235 

<210> 30 
<211> 718 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 30 

atggctctcg acactctcaa ttctcccacc tccaccacca caaccaccgc tcctcctcct 60 

ttcctccgtt gcctcgacga aaccgagccc gaaaacctcg aatcatggac caaaagaaaa 120 

cgtacaaaac gtcaccgtat agatcaacca aaccctcctc cttctgaaga agagtatctc 180 

gctctttgcc tccttatgct cgctcgtggc tcctccgatc atcactctcc accgtcggat 24 0 

catcactctc tttctccact gtccgatcat cagaaagatt acaagtgttc cgtctgtggc 300 

aaatctttcc cgtcttacca agcgttaggt ggacacaaaa caagtcaccg gaaaccggtt 360 

agtgtcgatg ttaataatag taacggaacc gttactaata acggaaatat tagtaacggt 420 

ttagttggtc aaagtgggaa gactcataac tgctctatat gttttaagtc gtttccctct 480 

ggtcaagcat tgggtggtca caaacgttgt cactatgatg gtggtaacgg taacagtaac 540 

ggtgacaata gccacaagtt tgacctaaat ttaccggctg atcaagttag tgatgagaca 600 

attggaaaaa gtcaactctc cggtgaagaa acaaagtcgg tgttgtgatt attattattt 660 

tttaccgatc gggattagct agtggttgat cattagctga gtctgtaatg aaaatgat 718 

<210> 31 
<211> 215 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 31 

Met Ala Leu Asp Thr Leu Asn Ser Pro Thr Ser Thr Thr Thr Thr Thr 
15 10 15 
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Ala Pro Pro Pro Phe Leu Arg Cys Leu Asp Glu Thr Glu Pro Glu Asn 
20 25 30 

Leu Glu Ser Trp Thr Lys Arg Lys Arg Thr Lys Arg His Arg lie Asp 
35 40 45 

Gin Pro Asn Pro Pro Pro Ser Glu Glu Glu Tyr Leu Ala Leu Cys Leu 
50 55 60 

Leu Met Leu Ala Arg Gly Ser Ser Asp His His Ser Pro Pro Ser Asp 
65 70 75 80 

His His Ser Leu Ser Pro Leu Ser Asp His Gin Lys Asp Tyr Lys Cys 
85 90 95 

Ser Val Cys Gly Lys Ser Phe Pro Ser Tyr Gin Ala Leu Gly Gly His 
100 105 110 

Lys Thr Ser His Arg Lys Pro Val Ser Val Asp Val Asn Asn Ser Asn 
115 120 125 

Gly Thr Val Thr Asn Asn Gly Asn lie Ser Asn Gly Leu Val Gly Gin 
130 135 140 

Ser Gly Lys Thr His Asn Cys Ser lie Cys Phe Lys Ser Phe Pro Ser 
145 ** 150 155 160 

Gly Gin Ala Leu Gly Gly His Lys Arg Cys His Tyr Asp Gly Gly Asn 
165 170 175 

Gly Asn Ser Asn Gly Asp Asn Ser His Lys Phe Asp Leu Asn Leu Pro 
180 185 190 

Ala Asp Gin Val Ser Asp Glu Thr lie Gly Lys Ser Gin Leu Ser Gly 
195 200 205 

Glu Glu Thr Lys Ser Val Leu 
210 215 

<210> 32 
<211> 702 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 32 

aaattttcta tagcaatggc gcttgaagct cttaattcac caagattggt cgaggatccc 60 

ttaagattca atggcgttga gcagtggacc aaatgtaaga aacgatccaa acgttcgaga 120 

tctgatcttc atcataacca ccgtctcact gaggaagagt atctagcttt ctgtctcatg 180 

cttcttgctc gggatggcgg cgatcttgac tctgtgacgg ttgcggagaa gccgagttat 240 

aagtgtggcg tttgttacaa gacgttttcg tcttaccaag ctctcggcgg tcataaagcg 300 

agccaccgga gcttatacgg tggtggagag aatgataaat cgacaccatc caccgccgtg 360 

aaatctcacg tttgttcggt ttgcgggaaa tctttcgcca ccggtcaagc tctcggcggc 420 

cacaagcggt gccactacga tggtggcgtt tcgaactcgg aaggtgtggg gtctactagc 480 

cacgtcagca gtagtagcca ccgtggattt gaccttaata ttataccggt gcagggattt 540 

tcgccggacg acgaagtgat gagtccgatg gcgactaaga agcctcgcct gaagtaagtc 600 

tttgttgaag acctggaagt ttatcaaatg taaatatcaa atttcaattt caaggaacag 660 

ttttgttgat tctattacca atacacaata cgattcaatt cc 702 
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<210> 33 
<211> 193 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 33 

Met Ala Leu Glu Ala Leu Asn Ser Pro Arg Leu Val Glu Asp Pro Leu 
15 10 15 

Arg Phe Asn Gly Val Glu Gin Trp Thr Lys Cys Lys Lys Arg Ser Lys 
20 25 30 

Arg Ser Arg Ser Asp Leu His His Asn His Arg Leu Thr Glu Glu Glu 
35 40 45 

Tyr Leu Ala Phe Cys Leu Met Leu Leu Ala Arg Asp Gly Gly Asp Leu 
50 55 60 

Asp Ser Val Thr Val Ala Glu Lys Pro Ser Tyr Lys Cys Gly Val Cys 
65 70 75 80 

Tyr Lys Thr Phe Ser Ser Tyr Gin Ala Leu Gly Gly His Lys Ala Ser 
85 90 95 

His Arg Ser Leu Tyr Gly Gly Gly Glu Asn Asp Lys Ser Thr Pro Ser 
100 105 110 

Thr Ala Val Lys Ser His Val Cys Ser Val Cys Gly Lys Ser Phe Ala 
115 120 125 

Thr Gly Gin Ala Leu Gly Gly His Lys Arg Cys His Tyr Asp Gly Gly 
130 135 140 

Val Ser Asn Ser Glu Gly Val Gly Ser Thr Ser His Val Ser Ser Ser 
145 150 155 160 

Ser His Arg Gly Phe Asp Leu Asn lie lie Pro Val Gin Gly Phe Ser 
165 170 175 

Pro Asp Asp Glu Val Met Ser Pro Met Ala Thr Lys Lys Pro Arg Leu 
180 185 190 

Lys 



<210> 34 
<211> 1157 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 34 

cacacttcac tctttcttca tcttcttctt cttaaatagc tcgaaatcac atctcacaga 60 
attaaatctt atggctctcg agactctcaa ttctccaaca gctaccacca ccgctcggcc 120 
tcttctccgg tatcgtgaag aaatggagcc tgagaatctc gagcaatggg ctaaaagaaa 180 
acgaacaaaa cgtcaacgtt ttgatcacgg tcatcagaat caagaaacga acaagaacct 240 
tccttctgaa gaagagtatc tcgctctttg tctcctcatg ctcgctcgtg gctccgccgt 300 
acaatctcct cctcttcctc ctctaccgtc acgtgcgtca ccgtccgatc accgagatta 360 
caagtgtacg gtctgtggga agtccttttc gtcataccaa gccttaggtg gacacaagac 420 
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gagtcaccgg aaaccgacga acactagtat cacttccggt aaccaagaac tgtctaataa 480 
cagtcacagt aacagcggtt ccgttgttat taacgttacc gtgaacactg gtaacggtgt 540 

600 
660 



840 
900 
960 



tagtcaaagc ggaaagattc acacttgctc aatctgtttc aagtcgtttg cgtctggtca 
agccttaggt ggacacaaac ggtgtcacta tgacggtggc aacaacggta acggtaacgg 
aagtagcagc aacagcgtag aactcgtcgc tggtagtgac gtcagcgatg ttgataatga 720 
gagatggtcc gaagaaagtg cgatcggtgg ccaccgtgga tttgacctaa acttaccggc 780 
tgatcaagtc tcagtgacga cttcttaacg ttgactgagt ttgaggaaaa agtcaactat 
caagcgaaga aagggttagt ggacggtgaa gattaacggt cgtttctttc cagttgcttc 
ggtttgagct tgactgggtc tgtaatgaaa atgattggag tggacttggc attattatta 
ttatttttaa aaagaaatgt taatttgttg ttggatttgt ttatagatag aggaaacaat 1020 
tgggatacac aaatattttt tttttttaca aagaaaataa taatgcagag atggatgatt 1080 
ggatcgtaca cgttattata tagtggacca ttctgtaatc gtgaattatt attatttgtt 1140 
agaaatttaa ttttcgt 1157 

<210> 35 
<211> 245 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 35 

Met Ala Leu Glu Thr Leu Asn Ser Pro Thr Ala Thr Thr Thr Ala Arg 
15 10 15 

Pro Leu Leu Arg Tyr Arg Glu Glu Met Glu Pro Glu Asn Leu Glu Gin 
20 25 30 

Trp Ala Lys Arg Lys Arg Thr Lys Arg Gin Arg Phe Asp His Gly His 
35 40 45 

Gin Asn Gin Glu Thr Asn Lys Asn Leu Pro Ser Glu Glu Glu Tyr Leu 
50 55 60 

Ala Leu Cys Leu Leu Met Leu Ala Arg Gly Ser Ala Val Gin Ser Pro 
65 70 75 80 

Pro Leu Pro Pro Leu Pro Ser Arg Ala Ser Pro Ser Asp His Arg Asp 
85 90 95 

Tyr Lys Cys Thr Val Cys Gly Lys Ser Phe Ser Ser Tyr Gin Ala Leu 
100 105 110 

Gly Gly His Lys Thr Ser His Arg Lys Pro Thr Asn Thr Ser He Thr 
115 120 125 

Ser Gly Asn Gin Glu Leu Ser Asn Asn Ser His Ser Asn Ser Gly Ser 
130 135 140 

Val Val He Asn Val Thr Val Asn Thr Gly Asn Gly Val Ser Gin Ser 
145 150 155 160 

Gly Lys He His Thr Cys Ser He Cys Phe Lys Ser Phe Ala Ser Gly 
165 170 175 

Gin Ala Leu Gly Gly His Lys Arg Cys His Tyr Asp Gly Gly Asn Asn 
180 185 190 

Gly Asn Gly Asn Gly Ser Ser Ser Asn Ser Val Glu Leu Val Ala Gly 
195 200 205 
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Ser Asp Val Ser Asp Val Asp Asn Glu Arg Trp Ser Glu Glu Ser Ala 
210 215 220 

lie Gly Gly His Arg Gly Phe Asp Leu Asn Leu Pro Ala Asp Gin Val 
225 230 235 240 

Ser Val Thr Thr Ser 
245 

<210> 36 

<211> 1213 

<212> DNA 

<213> Oryza sativa 

<400> 36 

aattcggcac gaggccacac agcaaccagc cagctgccac actagcttga ggcgagcgag 60 

cgaagcttag ctagcggata gaacaagtcg tcgatctgct tgctgctttt gtgaattgcg 120 

gtggaagcat gtcgagcgcg tcgtccatgg aagcgctcca cgccgcggtg ctcaaggagg 180 

agcagcagca gcacgaggtg gaggaggcga cggtcgtgac gagcagcagc gccacgagcg 240 

gggaggaggg cggacacctg ccccaggggt gggcgaagcg gaagcggtcg cgccgccagc 300 

gatcggagga ggagaacctc gcgctctgcc tcctcatgct cgcccgcggc ggccaccacc 360 

gcgtccaggc gccgcctccg ctctcggctt cggcgccccc gccggcaggt gcggagttca 420 

agtgctccgt ctgcggcaag tccttcagct cctaccaggc gctcggcggc cacaagacga 480 

gccaccgggt caagctgccg actccgcccg cagctcccgt cttggctccc gcccccgtcg 540 

ccgccttgct gccttccgcc gaggaccgcg agccagccac gtcatccacc gccgcgtcct 600 

ccgacggcat gaccaacaga gtccacaggt gttccatctg ccagaaggag ttccccaccg 660 

ggcaggcgct cggcgggcac aagaggaagc actacgacgg tggcgtaggc gccggcgccg 720 

gcgcatcttc aaccgagctc ctggccacgg tggccgccga gtccgaggtg ggaagctccg 780 

gcaacggcca gtccgccacc cgggcgttcg acctcaacct cccggccgtg ccggagttcg 840 

tgtggcggcc gtgctccaag ggcaagaaga tgtgggacga ggaggaggag gtccagagcc 900 

ccctcgcctt caagaagccc cggcttctca ccgcgtaatt cagcagctgc acggatccga 960 

tccgtcagag tttttgtcta gggagtgaaa ttcagtcgaa acacactatt cgttgattcg 1020 

ttttgtgccg ctattgttta atttgttcct gcttttgtac agagcaagcg agtgatacat 1080 

agccatacat acagtcatac agatataggt ctagctcttc cttggttctt tgtaacactg 1140 

gaactgtacc tgtatctttt acactttgtt ctttgacagt catatattgt agaccaaaaa 1200 

aaaaaaaaaa aaa 1213 

<210> 37 

<211> 269 

<212> PRT 

<213> Oryza sativa 

<400> 37 

Met Ser Ser Ala Ser Ser Met Glu Ala Leu His Ala Ala Val Leu Lys 
15 10 15 

Glu Glu Gin Gin Gin His Glu Val Glu Glu Ala Thr Val Val Thr Ser 
20 25 30 

Ser Ser Ala Thr Ser Gly Glu Glu Gly Gly His Leu Pro Gin Gly Trp 
35 40 45 

Ala Lys Arg Lys Arg Ser Arg Arg Gin Arg Ser Glu Glu Glu Asn Leu 
50 55 60 



Ala Leu Cys Leu Leu Met Leu Ala Arg Gly Gly His His Arg Val Gin 
65 70 75 80 
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Ala Pro Pro Pro Leu Ser Ala Ser Ala Pro Pro Pro Ala Gly Ala Glu 
85 90 95 

Phe Lys Cys Ser Val Cys Gly Lys Ser Phe Ser Ser Tyr Gin Ala Leu 
100 105 110 

Gly Gly His Lys Thr Ser His Arg Val Lys Leu Pro Thr Pro Pro Ala 
115 120 125 

Ala Pro Val "Leu Ala Pro Ala Pro Val Ala Ala Leu Leu Pro Ser Ala 
130 135 140 

Glu Asp Arg Glu Pro Ala Thr Ser Ser Thr Ala Ala Ser Ser Asp Gly 
145 150 155 160 

Met Thr Asn Arg Val His Arg Cys Ser lie Cys Gin Lys Glu Phe Pro 
165 170 175 

Thr Gly Gin Ala Leu Gly Gly His Lys Arg Lys His Tyr Asp Gly Gly 
180 185 190 

Val Gly Ala Gly Ala Gly Ala Ser Ser Thr Glu Leu Leu Ala Thr Val 
195 200 205 

Ala Ala Glu Ser Glu Val Gly Ser Ser Gly Asn Gly Gin Ser Ala Thr 
210 215 220 

Arg Ala Phe Asp Leu Asn Leu Pro Ala Val Pro Glu Phe Val Trp Arg 
225 230 235 240 

Pro Cys Ser Lys Gly Lys Lys Met Trp Asp Glu Glu Glu Glu Val Gin 
245 250 255 

Ser Pro Leu Ala Phe Lys Lys Pro Arg Leu Leu Thr Ala 
260 265 

<210> 38 
<211> 528 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 38 

atgaagagag accggtccga ttacgaagaa tccatgaagc atatagacat agtagaaagt 60 
ctaatgatgt tatctcgaag tttcgtggtc aaacaaatcg atgtaaagca atctaccgga 120 
agcaaaacga accataataa ccacttcgaa tgcaaaacgt gtaaccggaa atttgattcc 180 
ttccaagctc ttggaggtca tagagctagc cacaagaaac ctaagctgat cgttgaccaa 240 
gaacaggtga agcatcgtaa caaagagaat gatatgcata agtgtacaat ttgcgatcaa 300 
atgtttggga ccggtcaagc tctaggcggt cacatgagaa agcataggac gagcatgata 360 
accgagcaat cgattgtccc ttctgtggtt tattccagac cggtttttaa tcgttgcagt 420 
agcagcaagg agatcttgga cttaaatcta actccattgg aaaatgatct tgtgttaatc 480 
tttgggaaga atttggttcc acaaattgat ttgaagtttg tgaattag 528 

<210> 39 

<211> 175 

<212> PRT 

<213> Arabidopsis thaliana 
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<400> 39 

Met Lys Arg Asp Arg Ser Asp Tyr Glu Glu Ser Met Lys His lie Asp 
15 10 15 

lie Val Glu Ser Leu Met Met Leu Ser Arg Ser Phe Val Val Lys Gin 
20 25 30 

He Asp Val Lys Gin Ser Thr Gly Ser Lys Thr Asn His Asn Asn His 
35 40 45 

Phe Glu Cys Lys Thr Cys Asn Arg Lys Phe Asp Ser Phe Gin Ala Leu 
50 55 60 

Gly Gly His Arg Ala Ser His Lys Lys Pro Lys Leu He Val Asp Gin 
65 70 75 80 

Glu Gin Val Lys His Arg Asn Lys Glu Asn Asp Met His Lys Cys Thr 
85 90 95 

He Cys Asp Gin Met Phe Gly Thr Gly Gin Ala Leu Gly Gly His Met 
100 105 110 

Arg Lys His Arg Thr Ser Met He Thr Glu Gin Ser He Val Pro Ser 
115 120 125 

Val Val Tyr Ser Arg Pro Val Phe Asn Arg Cys Ser Ser Ser Lys Glu 
130 135 140 

He Leu Asp Leu Asn Leu Thr Pro Leu Glu Asn Asp Leu Val Leu He 
145 150 155 160 

Phe Gly Lys Asn Leu Val Pro Gin He Asp Leu Lys Phe Val Asn 
165 17 0 175 



<210> 40 

<211> 820 

<212> DNA 

<213> Saccharum 



of f icinarum 



<220> 

<221> misc_feature 

<222> (406) . . (406) 

<223> n can be any nucleotide 

<220> 

<221> mis cofeature 

<222> (581) . . (582) 

<223> n can be any nucleotide 

<220> 

<221> misc_feature 

<222> (589) . . (589) 

<223> n can be any nucleotide 



<400> 40 

cctaaccagc attagctttt caaatcaaca agcctcgccg tgaccgatcg atggccatca 
cccacgacga ctacgtctcc ctctgcctca tggcgctcgc agccgcggga ggcggaggcc 



60 
120 
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aagctggttt aacaacgcag tacgctctga acacggctgc ctggacagcg acggcgcaag 180 

agtccgagct ccgcttccgg tgctccgtct gtggcaaggc cttcgcgtcg caccaggcac 240 

tgggcgggca caaggccagc caccgcaagc cgacgctcgt* acaggcacat gcgtcgtcct 300 

cagccggagg cgcggcgtcg tcgtcggtaa caatgacctc ggccgtaggc agcagtgggc 360 

aggggaggca caggtgcacg gtgtgccatc ggagcttcgc gacggngcaa gcgctcggcg 420 

ggcacaagag gtgccattac tgggacgggc tctcggtgtc gctcaccgcg tcgtcggcgc 480 

catcggggtc cgggtcgacc gtcaagggct ttgatctgaa tttggtgccg gtgccgcccg 540 

cgatggccgc caacgctgcg acaaggtggg gagaggagaa nnaagtcana aacccttggc 600 

ggtcaagaga aggcggcttg ccggtccgtc ttggacccta atttaacgat ttagaagtcc 660 

tttttttaat aattaagagt tcttttgaag aaggttgtaa agttttcgaa ccttgttctt 720 

ttaatggatt tgggtgctgg cgaaatttta aaactggatt taaatttgcg ctcactcttt 780 

ttttttattt tttacaccct tttttttttt tagaagaaga 820 

<210> 41 
<211> 1509 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 41 

ttcctttctc ttcctctctc tctctcttca ccatgactga tccttattcc aatttcttca 60 

cagactggtt caagtctaat ccttttcacc attaccctaa ttcctccact aacccctctc 120 

ctcatcctct tcctcctgtt actcctccct cttccttctt cttcttccct caatccggag 180 

acctccgccg tccaccgccg ccaccaactc ctcctccttc tcctcctctc cgagaagccc 240 

tccctctcct cagcctcagc cccgccaaca aacaacaaga ccaccatcac aaccatgacc 300 

accttattca agaaccacct tcaacctcca tggatgtcga ctacgatcat caccatcaag 360 

atgatcatca taacctcgat gacgatgacc atgacgtcac cgttgctctt cacataggcc 420 

ttccaagccc tagtgctcaa gagatggcct ctttgctcat gatgtcttct tcttcctctt 480 

cctcgaggac cactcatcat cacgaggaca tgaatcacaa gaaagacctc gaccatgagt 540 

acagccacgg agctgtcgga ggaggagaag atgacgatga agattcagtc ggcggagacg 600 

gcggctgtag aatcagcaga ctcaacaagg gtcaatattg gatccctaca ccttctcaga 660 

ttctcattgg ccctactcag ttctcatgtc ctgtttgctt caaaaccttc aacagataca 720 

ataacatgca gatgcatatg tggggacatg gatcacaata cagaaaagga cctgaatctc 780 

taaggggaac acaaccaaca ggaatgctaa ggcttccgtg ctattgctgc gccccaggct 840 

gtcgcaacaa cattgaccat ccaagggcaa agcctctcaa agacttcaga acccttcaaa 900 

cacattacaa gagaaaacat gggatcaaac ctttcatgtg taggaaatgt ggaaaggctt 960 

tcgcagtccg aggggactgg agaacacatg agaagaattg tggcaaactt tggtattgca 1020 

tatgtggatc tgatttcaag cacaagagat ctctcaaaga tcacatcaag gcttttggga 1080 

atggtcatgg agcctacgga attgatgggt ttgatgaaga agatgagcct gcctctgagg 1140 

tagaacaatt agacaatgat catgagtcaa tgcagtctaa atagcttata tatattacta 12 00 

taagtactaa gtaattcggt atatatatta attataagaa acctaaatct atggaccaag 1260 

ttttgatgga ggtagggctt ttcaaactaa aagctatatc atctaattga tcataggaaa 1320 

aaaatgaatc aagagcactt ggaaaatttt aaattgtatc tttagcttcc tagttaaatt 1380 

tattgcaaga caatgtagca gtctaaccaa tgaggttccc aacggtttat ttctatttgt 1440 

atattatttt gtcattagct tcacctttcg ttaattcgaa ggacataact tataaatgtt 1500 

taaattatg 1509 

<210> 42 
<211> 383 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 42 

Met Thr Asp Pro Tyr Ser Asn Phe Phe Thr Asp Trp Phe Lys Ser Asn 
1 5 10 15 



Pro Phe His His Tyr Pro Asn Ser Ser Thr Asn Pro Ser Pro His Pro 
20 "* 25 30 
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Leu Pro Pro Val Thr Pro Pro Ser Ser Phe Phe Phe Phe Pro Gin Ser 
35 40 45 

Gly Asp Leu Arg Arg Pro Pro Pro Pro Pro Thr Pro Pro Pro Ser Pro 
50 55 60 

Pro Leu Arg Glu Ala Leu Pro Leu Leu Ser Leu Ser Pro Ala Asn Lys 
65 70 75 80 

Gin Gin Asp His His His Asn His Asp His Leu lie Gin Glu Pro Pro 
85 90 95 

Ser Thr Ser Met Asp Val Asp Tyr Asp His His His Gin Asp Asp His 
100 105 110 

His Asn Leu Asp Asp Asp Asp His Asp Val Thr Val Ala Leu His lie 
115 120 125 

Gly Leu Pro Ser Pro Ser Ala Gin Glu Met Ala Ser Leu Leu Met Met 
130 135 140 

Ser Ser Ser Ser Ser Ser Ser Arg Thr Thr His His His Glu Asp Met 
145 150 155 160 

Asn His Lys Lys Asp Leu Asp His Glu Tyr Ser His Gly Ala Val Gly 
165 170 ■ 175 

Gly Gly Glu Asp Asp Asp Glu Asp Ser Val Gly Gly Asp Gly Gly Cys 
180 185 190 

Arg lie Ser Arg Leu Asn Lys Gly Gin Tyr Trp lie Pro Thr Pro Ser 
195 200 205 

Gin lie Leu lie Gly Pro Thr Gin Phe Ser Cys Pro Val Cys Phe Lys 
210 215 220 

Thr Phe Asn Arg Tyr Asn Asn Met Gin Met His Met Trp Gly His Gly 
225 230 235 240 

Ser Gin Tyr Arg Lys Gly Pro Glu Ser Leu Arg Gly Thr Gin Pro Thr 
245 250 255 

Gly Met Leu Arg Leu Pro Cys Tyr Cys Cys Ala Pro Gly Cys Arg Asn 
260 ' 265 270 

Asn lie Asp His Pro Arg Ala Lys Pro Leu Lys Asp Phe Arg Thr Leu 
275 280 285 

Gin Thr His Tyr Lys Arg Lys His Gly lie Lys Pro Phe Met Cys Arg 
290 295 300 

Lys Cys Gly Lys Ala Phe Ala Val Arg Gly Asp Trp Arg Thr His Glu 
305 " " 310 315 320 

Lys Asn Cys Gly Lys Leu Trp Tyr Cys He Cys Gly Ser Asp Phe Lys 
325 330 335 

His Lys Arg Ser Leu Lys Asp His He Lys Ala Phe Gly Asn Gly His 



WO 2004/058980 



27/33 



PCT/EP2003/051104 



340 345 350 

Gly Ala Tyr Gly lie Asp Gly Phe Asp Glu Glu Asp Glu Pro Ala Ser 
355 360 365 

Glu Val Glu Gin Leu Asp Asn Asp His Glu Ser Met Gin Ser Lys 
370 375 380 

<210> 43 
<211> 1303 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 43 

atctacacac tactactcac atctcatctc tctctagcac atacccatca aaccatatag 60 

atacggtgct tttattcttg atcttcttct tcttctttgt cttctcctca gagtcatgtc 120 

taatccagct tgttcgaatc tcttcaacaa tggatgtgac cataatagct tcaactattc 180 

cacttctctc tcttacattt acaactctca cggtagctac tattactcta ataccacaaa 240 

ccctaattac attaatcata ctcataccac ttccacttcc cctaactcac ccccactaag 300 

agaagctctt cctcttctta gcttaagccc cataaggcac caagaacaac aagaccaaca 360 

ctatttcatg gacacccatc aaattagctc ttcaaacttt cttgatgatc ctcttgtgac 420 

tgtggatctt catctagggt taccaaacta cggtgttggt gagagcatta ggagcaatat 480 

tgctcctgat gcaaccacgg acgagcaaga tcaagatcat gaccgaggag tagaagtcac 540 

agttgagtcc caccttgatg atgatgatga tcatcatgga gatctacaca gaggtcatca 600 

ctattggatt cctactcctt ctcagatttt gattggtcct acacagttca cttgtcctct 660 

ttgcttcaag acattcaaca gatacaacaa catgcagatg cacatgtggg gacacggctc 720 

acaatacaga aagggaccag aatccttaag aggaacccaa ccaacaggaa tgctaagact 780 

accatgtttc tgctgtgcac ccggttgcaa gaacaacatt gaccacccac gagccaagcc 840 

tcttaaggac tttcgaaccc tccaaacaca ttacaaacgt aaacatgggt ctaaaccatt 900 

tgcttgtcgt atgtgtggta aggcctttgc agtgaaagga gattggagaa cgcatgagaa 960 

gaattgtgga aagctttggt attgctcttg tggctcggat tttaagcaca agaggtcgct 1020 

taaggaccat gtcaaggcct ttggaaatgg tcatgttcct tgtgggattg atagttttgg 10 80 

aggagatcat gaggactact atgatgctgc ttctgatatc gagcaataag atgatagcaa 1140 

caacaatgag tgttaattag gggttttgtt tatttttcct ctcatgcatt agttgattgt 1200 

atgcacgtgt tctttagttt tgttcttcgg atctttgttt tattttgttt tgagctgttt 12 60 

tttttttaat tactaagaag ttaattatca tctaaagatt ttc 1303 

<210> 44 

<211> 337 

<212> PRT 

<213> Arabidopsis thaliana 

<400> 44 

Met Ser Asn Pro Ala Cys Ser Asn Leu Phe Asn Asn Gly Cys Asp His 
1 5 10 15 

Asn Ser Phe Asn Tyr Ser Thr Ser Leu Ser Tyr lie Tyr Asn Ser His 
20 25 30 

Gly Ser Tyr Tyr Tyr Ser Asn Thr Thr Asn Pro Asn Tyr lie Asn His 
35 ' 40 45 

Thr His Thr Thr Ser Thr Ser Pro Asn Ser Pro Pro Leu Arg Glu Ala 
50 55 60 

Leu Pro Leu Leu Ser Leu Ser Pro lie Arg His Gin Glu Gin Gin Asp 
65 70 75 80 
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Gin His Tyr Phe Met Asp Thr His Gin He Ser Ser Ser Asn Phe Leu 
85 90 95 

Asp Asp Pro Leu Val Thr Val Asp Leu His Leu Gly Leu Pro Asn Tyr 
100 105 110 

Gly Val Gly Glu Ser He Arg Ser Asn He Ala Pro Asp Ala Thr Thr 
115 120 125 

Asp Glu Gin Asp Gin Asp His Asp Arg Gly Val Glu Val Thr Val Glu 
130 135 140 

Ser His Leu Asp Asp Asp Asp Asp His His Gly Asp Leu His Arg Gly 
145 150 155 160 

His His Tyr Trp lie Pro Thr Pro Ser Gin He Leu He Gly Pro Thr 
165 170 175 

Gin Phe Thr Cys Pro Leu Cys Phe Lys Thr Phe Asn Arg Tyr Asn Asn 
180 185 190 

Met Gin Met His Met Trp Gly His Gly Ser Gin Tyr Arg Lys Gly Pro 
195 200 205 

Glu Ser Leu Arg Gly Thr Gin Pro Thr Gly Met Leu Arg Leu Pro Cys 
210 215 220 

Phe Cys Cys Ala Pro Gly Cys Lys Asn Asn He Asp His Pro Arg Ala 
225 230 235 240 

Lys Pro Leu Lys Asp Phe Arg Thr Leu Gin Thr His Tyr Lys Arg Lys 
24 5 250 255 

His Gly Ser Lys Pro Phe Ala Cys Arg Met Cys Gly Lys Ala Phe Ala 
260 265 270 

Val Lys Gly Asp Trp Arg Thr His Glu Lys Asn Cys Gly Lys Leu Trp 
275 280 285 

Tyr Cys Ser Cys Gly Ser Asp Phe Lys His Lys Arg Ser Leu Lys Asp 
290 295 300 

His Val Lys Ala Phe Gly Asn Gly His Val Pro Cys Gly He Asp Ser 
305 310 315 320 

Phe Gly Gly Asp His Glu Asp Tyr Tyr Asp Ala Ala Ser Asp He Glu 
325 330 335 

Gin 



<210> 45 

<211> 495 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 45 

atggttgcga gaagtgagga agttgagata gtggaagata cggcggcgaa atgtttgatg 



60 
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ttgttatcaa gagttggaga atgcggcgga ggaggagaga aacgagtttt ccgatgcaag 120 

acttgtctta aagagttttc gtcgtttcaa gctttgggag gtcatcgtgc aagccacaag 180 

aaactcatta acagtagcga tccatcactt cttggatcct tgtctaacaa gaaaactaaa 240 

acggcgacgt ctcatccttg tccgatatgt ggcgtggagt ttccgatggg gcaagctctt 300 

ggtggtcaca tgaggagaca taggagtgag aaagcctcac caggcacgtt ggttacacgt 360 

tcttttttac cggagacgac gacggtgacg actttgaaaa aatcgagtag tgggaagaga 420 

gtggcttgtt tggacttaga ttcgatggag agtttagtca attggaagtt ggagttggga 480 

agaacgattt cttga 495 

<210> 46 
<211> 164 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 46 

Met Val Ala Arg Ser Glu Glu Val Glu He Val Glu Asp Thr Ala Ala 
15 10 15 

Lys Cys Leu Met Leu Leu Ser Arg Val Gly Glu Cys Gly Gly Gly Gly 
20 25 30 

Glu Lys Arg Val Phe Arg Cys Lys Thr Cys Leu Lys Glu Phe Ser Ser 
35 40 45 

Phe Gin Ala Leu Gly Gly His Arg Ala Ser His Lys Lys Leu He Asn 
50 55 60 

Ser Ser Asp Pro Ser Leu Leu Gly Ser Leu Ser Asn Lys Lys Thr Lys 
65 70 75 80 

Thr Ala Thr Ser His Pro Cys Pro He Cys Gly Val Glu Phe Pro Met 
85 90 95 

Gly Gin Ala Leu Gly Gly His Met Arg Arg His Arg Ser Glu Lys Ala 
100 105 HO 

Ser Pro Gly Thr Leu Val Thr Arg Ser Phe Leu Pro Glu Thr Thr Thr 
115 120 125 

Val Thr Thr Leu Lys Lys Ser Ser Ser Gly Lys Arg Val Ala Cys Leu 
130 135 140 

Asp Leu Asp Ser Met Glu Ser Leu Val Asn Trp Lys Leu Glu Leu Gly 
145 150 155 160 

Arg Thr lie Ser 



<210> 47 
<211> 1209 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 47 

atggaagacg aacatcaaga tctccataaa cccattaatg gagctttgcg agacctcaag 60 
attactcggt cacagaaaga aacagaaaag tctacgaacc aacagcaaga tgttacttgt 120 
tactatggtc taagggaaaa ctcgaagaag aaaacccagg aatctccgga accaatgaag 180 
aagattttgt ttcgatgcga agaatgtgga aaagggtttc ggtacgagaa atattttaag 240 
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aatcatcgct cgatgatgca tttatcgccg aacgagaagg tttgtgaaga atccttgatg 300 

actctgtctc gtagccttgg gtttgtgaag aagaagaaaa gatcaagact tggtaggtct 360 

gggaagactt tatttactac gtttcttgaa ccgagttcta tttttgatgc gactgatgaa 420 

gaattagaag tggcggattg tttgattcta ttgtctaaga gtgctcccaa ggttgtagac 480 

gaattgaaaa gtctttctga ggcagtacgt gttactcctg aaacacctga aagtagctat 540 

gatttgggtt gtttgctcaa caagaaaccg agaaaaggtg gtgaattgga atctggggtt 600 

ttaagtaatg agcaaagact tatggaagaa gggtttagta gttatggaac atcgaaagaa 660 

ccagctagct tcttgagaga cgaaaacaga ttggatcagc agaaacggag aaaagatggt 720 

gaatttgaat ccggactttt gagtaatgag caaagactgc tagaagaaga gattactact 780 

cctgtgacat tcaaaggtcc agcgagttcc ttgagacaca agtgtgcttt ggatcgaaat 840 

ggaggtgaat ttggtcctga gtttttgagt aatgagcaaa cactgatgga agaaacatgg 900 

aaagaaccag tgagtttctt agaagataag catgaatttg atcagcggaa aatgcgagaa 960 

gctggcgact ttgaatctag gttttacaga attgagcttg gagtaggagc tatggagtgt 1020 

acttcttcag atactgatat gctcacgcaa tctgataaga agaacgttga gcatcgatgc 1080 

aggttgtgca acaagatatt ctcgtcttat caagctctag ggggtcatca gacgtttcat 1140 

cggatgagca aatgtaagaa caagaagaat ggcatagagg aatcagttga acccaggatg 1200 

actctgtga 1209 

<210> 48 
<211> 402 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 48 

Met Glu Asp Glu His Gin Asp Leu His Lys Pro lie Asn Gly Ala Leu 
15 10 15 

Arg Asp Leu Lys lie Thr Arg Ser Gin Lys Glu Thr Glu Lys Ser Thr 
20 25 30 

Asn Gin Gin Gin Asp Val Thr Cys Tyr Tyr Gly Leu Arg Glu Asn Ser 
35 40 45 

Lys Lys Lys Thr Gin Glu Ser Pro Glu Pro Met Lys Lys lie Leu Phe 
50 55 60 

Arg Cys Glu Glu Cys Gly Lys Gly Phe Arg Tyr Glu Lys Tyr Phe Lys 
65 70 75 80 

Asn His Arg Ser Met Met His Leu Ser Pro Asn Glu Lys Val Cys Glu 
85 90 95 

Glu Ser Leu Met Thr Leu Ser Arg Ser Leu Gly Phe Val Lys Lys Lys 
100 105 110 

Lys Arg Ser Arg Leu Gly Arg Ser Gly Lys Thr Leu Phe Thr Thr Phe 
115 120 125 

Leu Glu Pro Ser Ser lie Phe Asp Ala Thr Asp Glu Glu Leu Glu Val 
130 135 140 

Ala Asp Cys Leu lie Leu Leu Ser Lys Ser Ala Pro Lys Val Val Asp 
145 150 155 160 

Glu Leu Lys Ser Leu Ser Glu Ala Val Arg Val Thr Pro Glu Thr Pro 
165 170 175 

Glu Ser Ser Tyr Asp Leu Gly Cys Leu Leu Asn Lys Lys Pro Arg Lys 
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180 185 190 

Gly Gly Glu Leu Glu Ser Gly Val Leu Ser Asn Glu Gin Arg Leu Met 
195 200 205 

Glu Glu Gly Phe Ser Ser Tyr Gly Thr Ser Lys Glu Pro Ala Ser Phe 
210 215 220 

Leu Arg Asp Glu Asn Arg Leu Asp Gin Gin Lys Arg Arg Lys Asp Gly 
225 ^ 230 235 240 

Glu Phe Glu Ser Gly Leu Leu Ser Asn Glu Gin Arg Leu Leu Glu Glu 
245 250 255 

Glu lie Thr Thr Pro Val Thr Phe Lys Gly Pro Ala Ser Ser Leu Arg 
260 265 270 

His Lys Cys Ala Leu Asp Arg Asn Gly Gly Glu Phe Gly Pro Glu Phe 
275 280 285 

Leu Ser Asn Glu Gin Thr Leu Met Glu Glu Thr Trp Lys Glu Pro Val 
290 295 300 

Ser Phe Leu Glu Asp Lys His Glu Phe Asp Gin Arg Lys Met Arg Glu 
305 310 315 320 

Ala Gly Asp Phe Glu Ser Arg Phe Tyr Arg lie Glu Leu Gly Val Gly 
325 330 335 

Ala Met Glu Cys Thr Ser Ser Asp Thr Asp Met Leu Thr Gin Ser Asp 
340 345 350 

Lys Lys Asn Val Glu His Arg Cys Arg Leu Cys Asn Lys lie Phe Ser 
355 360 365 

Ser Tyr Gin Ala Leu Gly Gly His Gin Thr Phe His Arg Met Ser Lys 
370 375 380 

Cys Lys Asn Lys Lys Asn Gly lie Glu Glu Ser Val Glu Pro Arg Met 
385 390 395 400 

Thr Leu 



<210> 49 
<211> 1087 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 49 

cttgttagtt cactccacat aataaacacc aaagatttca ttctcttctc cataatttcg 60 
aagtttcttg aattgggttt gtttcttgat ttgtttcttg aattgggttt tggtcttctt 120 
ttcttactat atttggatat gatgatgggt caagatgagg ttgggagtga tcagacgcaa 180 
atcataaaag ggaaacgtac gaagcgacaa agatcgtctt cgacgtttgt ggtgacggcg 240 
gcgacaacag tgacttcaac aagttcatcg gccggtggaa gtggaggaga aagagctgtt 300 
tcagatgaat acaactcggc ggtttcgtct ccggtgacta ctgattgtac gcaagaagaa 360 
gaagacatgg cgatttgtct catcatgtta gctcgtggga cagttctt cc atcgccggat 420 
ctcaagaact cgagaaaaat tcatcagaag atttcgtcgg agaattctag tttctatgtg 480 
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tacgagtgta aaacgtgtaa ccggacgttt tcgtcgttcc aagcacttgg tggacacaga 540 

gcgagccaca agaagccgag gacgtcgact gaggaaaaga ctagactacc cctgacgcaa 600 

cccaagtcta gtgcatcaga agaagggcaa aacagtcatt tcaaagtttc cggctcagcc 660 

ctagcttcac aggcaagtaa catcatcaac aaggcaaaca aagtacacga gtgttccatc 720 

tgcggttctg agttcacttc cgggcaagct ctcggtggtc acatgaggcg gcacaggaca 780 

gccgtaacca cgattagccc cgttgcagcc accgcagaag taagcagaaa cagtacagag 840 

gaagagattg agatcaatat aggccgttcg atggaacagc agaggaaata tctaccgttg 900 

gatcttaatc taccagcacc aggagatgat ctaagagagt ccaagtttca agggatagta 960 

ttctcagcaa caccagcgtt aatagattgt cattactagt tgtttttttt actacataat 1020 

atgatgaaat atttgtgaat tcttcttact tactactata ttgttgatca aaaaaaaaaa 1080 

aaaaaaa 1087 

<210> 50 
<211> 284 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 50 

Met Gly Gin Asp Glu Val Gly Ser Asp Gin Thr Gin lie He Lys Gly 
15 10 15 

Lys Arg Thr Lys Arg Gin Arg Ser Ser Ser Thr Phe Val Val Thr Ala 
2 0 25 30 

Ala Thr Thr Val Thr Ser Thr Ser Ser Ser Ala Gly Gly Ser Gly Gly 
35 40 45 

Glu Arg Ala Val Ser Asp Glu Tyr Asn Ser Ala Val Ser Ser Pro Val 
50 55 60 

Thr Thr Asp Cys Thr Gin Glu Glu Glu Asp Met Ala He Cys Leu He 
65 70 75 80 

Met Leu Ala Arg Gly Thr Val Leu Pro Ser Pro Asp Leu Lys Asn Ser 
85 90 95 

Arg Lys He His Gin Lys He Ser Ser Glu Asn Ser Ser Phe Tyr Val 
100 105 110 

Tyr Glu Cys Lys Thr Cys Asn Arg Thr Phe Ser Ser Phe Gin Ala Leu 
115 120 125 

Gly Gly His Arg Ala Ser His Lys Lys Pro Arg Thr Ser Thr Glu Glu 
130 135 140 

Lys Thr Arg Leu Pro Leu Thr Gin Pro Lys Ser Ser Ala Ser Glu Glu 
145 ~ 150 155 160 

Gly Gin Asn Ser His Phe Lys Val Ser Gly Ser Ala Leu Ala Ser Gin 
165 170 175 

Ala Ser Asn He He Asn Lys Ala Asn Lys Val His Glu Cys Ser lie 
180 185 190 

Cys Gly Ser Glu Phe Thr Ser Gly Gin Ala Leu Gly Gly His Met Arg 
195 200 205 

Arg His Arg Thr Ala Val Thr Thr He Ser Pro Val Ala Ala Thr Ala 
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210 

Glu Val 
225 

Arg Ser 
Pro Ala 
Phe Ser 



Ser Arg 

Met Glu 

Pro Gly 
260 

Ala Thr 
275 



Asn Ser 
230 

Gin Gin 
245 

Asp Asp 
Pro Ala 



215 

Thr Glu 

Arg Lys 

Leu Arg 

Leu lie 
280 



Glu Glu 



Tyr Leu 
250 

Glu Ser 
265 

Asp Cys 



220 

He Glu 
235 

Pro Leu 
Lys Phe 
His Tyr 



He Asn He Gly 
240 

Asp Leu Asn Leu 

255 

Gin Gly lie Val 
270 



